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INTRODUCTORY NOTE 


The Tomash/American Institute of Physics series in the History of 
Modem Physics offers the opportunity to follow the evolution of physics 
from its classical period in the nineteenth century when it emerged as a 
distinct discipline, through the early decades of the twentieth century 
when its modem roots were established, into the middle years of this 
century when physicists continued to develop extraordinary theories 
and techniques. The one hundred and fifty years covered by the series, 
1800 to 1950, were crucial to all mankind not only because profound 
evolutionary advances occurred but also because some of these led to 
such applications as the release of nuclear energy. Our primary intent 
has been to choose a collection of historically important literature which 
would make this most significant period readily accessible. 

We believe that the history of physics is more than just the narrative 
of the development of theoretical concepts and experimental results: it is 
also about the physicists individually and as a group—how they pursued 
their separate tasks, their means of support and avenues of communica¬ 
tion, and how they interacted with other elements of their contemporary 
society. To express these interwoven themes we have identified and se¬ 
lected four types of works: reprints of “classics” no longer readily avail¬ 
able; original monographs and works of primary scholarship, some pre¬ 
viously only privately circulated, which warrant wider distribution; 
anthologies of important articles here collected in one place; and disser¬ 
tations, recently written, revised, and enhanced. Each book is prefaced 
by an introductory essay written by an acknowledged scholar, which, by 
placing the material in its historical context, makes the volume more 
valuable as a reference work. 

The books in the series are all noteworthy additions to the literature of 
the history of physics. They have been selected for their merit, distinc¬ 
tion, and uniqueness. We believe that they will be of interest not only to 
the advanced scholar in the history of physics, but to a much broader, 
less specialized group of readers who may wish to understand a science 
that has become a central force in society and an integral part of our 
twentieth-century culture. Taken in its entirety, the series will bring to 
the reader a comprehensive picture of this major discipline not readily 
achieved in any one work. Taken individually, the works selected will 
surely be enjoyed and valued in themselves. 




Max Jammer 
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Preface to the Second Edition 


The first edition of The Conceptual Development of Quantum Mechanics 
was published in 1966 by the McGraw-Hill Book Company in the Inter¬ 
national Series in Pure and Applied Physics. It was reprinted several times 
and was also available as a paperback, but it has been out-of-print since 
1972. Translations into foreign languages, among them a two-volume Jap¬ 
anese translation by Professor Koide Shoichiro, published in 1974 by 
Tokyo Tosho K. K., and a Russian translation by Academicians V. N. 
Pokzovskij and L. I. Ponomareva, published by Nauka, Moscow, have 
been available. 

The present volume, published in the History of Modem Physics: 
1800-1950 series by the American Institute of Physics in New York, is an 
enlarged and revised version of the first edition. The original text, with all 
of its footnotes, has been carefully checked and updated; new sections and 
a new appendix have been included, as well as an extensive bibliography 
which will enable the reader to pursue particular items of interest. 

These modifications and additions to the new edition do not, however, 
change the purpose for which this book was originally written. It should 
serve, first of all, as a companion volume to any course 6n quantum me¬ 
chanics, in order to provide a more profound understanding of the princi¬ 
ples of this theory by revealing the conceptual origins and background. In 
addition, it is also designed to offer a systematic, comprehensive and co¬ 
herent account of the historical development of quantum mechanics for 
the historian and philosopher of modem science. 

As stated in the preface to the first edition in 1966, I had hoped to 
continue this line of research with a sequel volume on the conceptual 
development of relativistic quantum mechanics and quantum field theo¬ 
ry. However, John Stewart Bell’s paper on hidden variables which ap¬ 
peared in the July 1966 issue of the Reviews of Modern Physics , together 
with his paper on the Einstein-Podolsky-Rosen paradox, threw new light 
on the interpretations of quantum mechanics. They initiated a develop¬ 
ment in which, among many others, the experimentalist John F. Clauser, 
who at that time attended my lectures at Columbia University in New 
York, and the theoretician Jeffrey Bub, with whom I had long discussions 
at the Minnesota Center for the Philosophy of Science in Minneapolis, 
were actively involved. Prompted by these developments, I wrote The 
Philosophy of Quantum Mechanics (Wiley-Interscience, New York, 
1974), which has been translated into Japanese (by Kinokuniya Shoten, 
Tokyo) and Chinese (by Kecheng Qin, Peking University, Beijing). 
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The reader may perhaps wonder why a book on the development of 
modem physics, dealing with historical issues that apparently are faits 
accomplis, should have to be revised and emended, especially as Werner 
Heisenberg and Paul Dirac had approved the final draft. The reason is, of 
course, that the development of quantum mechanics, said to have reached 
its apex about sixty years ago, is nevertheless still an unfinished business 
today. 

In fact, the conceptual revolution brought about by quantum mechan¬ 
ics is so radical and penetrating that any theoretical innovation discovered 
today is apt to produce a re-interpretation and re-evaluation of results 
obtained in the past. A good example is the 1935 Einstein-Podolsky- 
Rosen paradox which was presented at the end of the 1966 edition, but 
whose real significance became clear only through the above-mentioned 
work of Bell and his followers beginning in 1966. Just as new insights in 
present-day physics may change our views and interpretations of past 
developments so, conversely, as shown by the example of Dirac (men¬ 
tioned in the preface to the first edition), the study of the historical devel¬ 
opment of modem physics may have an impact on current research. One 
should never regard the history of science, and especially the history of 
quantum mechanics, as a closed file. As Johann Wolfgang von Goethe, in 
his writings on geology and mineralogy, stated, “The history of science is 
science itself.” 


Max Jammer 
July 1988 



Preface 


It is the purpose of this study to trace the conceptual development of 
quantum mechanics from its inception to its formulation as a full-fledged 
theory of atomic physics, from its status as a rather doubtful ad hoc hy¬ 
pothesis to that of an imposing intellectual structure of great beauty. 

All the great theories in the history of physics—from Aristotelian me¬ 
chanics and its medieval elaborations, through Newtonian dynamics with 
its Lagrangian or Hamiltonian modifications, to Maxwellian electrody¬ 
namics and Einsteinian relativity—have been subjected repeatedly to his- 
torico-critical investigations, and their conceptual foundations have been 
thoroughly analyzed. But no comprehensive scholarly study of the con¬ 
ceptual development of quantum mechanics has heretofore appeared. The 
popular or semiscientific publications available hardly skim the surface of 
the subject. And the few, though important, essays on the topic written by 
the originators of the theory themselves are mostly confined to a particu¬ 
lar aspect or to the defense of a specific philosophical position. The publi¬ 
cation of a comprehensive and coherent analysis of the conceptual devel¬ 
opment of quantum mechanics, the only consistent theory of atomic 
processes, and hence a foundation of modern science, seems therefore to 
fill an important lacuna in the literature on the history and philosophy of 
physics. 

Such a study, however, should not be regarded as an end in itself. Never 
before has a theory been treated in so many excellent texts within so short 
a period of time. Since every additional text, as is natural in a developing 
branch of science, attempts to construct the theory on the basis of an ever 
more concise logical structure, the traditional texts presenting the materi¬ 
al in the order of its historical development are gradually losing ground. 
Students, it is rightfully claimed, may be saved much trouble “if they are 
not led through all the historical pitfalls, and instead acquainted from the 
very beginning with concepts, such as the spin, that cannot be grasped 
except by quantum mechanical means.” 1 Admittedly, a good working 
knowledge, even a high standard of efficiency and competence in applying 
the theory to physical problems may be obtained by this method of in¬ 
struction. But as long as it is true that “however far the phenomena tran¬ 
scend the scope of classical physical explanation, the account of all evi¬ 
dence must be expressed in classical terms,” 2 a profound comprehension 
of quantum mechanics requires more than a study of the subject in its 
shortest possible logical formulation. “It is impossible to understand the 
methods of modem quantum mechanics without a knowledge of the way 
in which the theory has been developing.” 3 Indeed, some knowledge of 
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the dramatic struggle of ideas that preceded the formation of quantum 
conceptions and of the intricate ways of reasoning that led to the generally 
accepted formulation of the theory is indispensable for a profound com¬ 
prehension of its physical significance, for an intelligent appreciation of its 
philosophical implications, and even for an enlightened understanding of 
its logical structure. Duhem’s statement “faire Phistoire d’un principe 
physique, c’est en meme temps en fair l’analyse logique” is particularly 
meaningful for the science of quantum mechanics. 

The subject of our discussion may be of interest not only from the 
historical and methodological, but also from the purely scientific, point of 
view. If the fundamental conceptions of theoretical physics cannot be 
extracted from experience alone but are, as Einstein once said, “free in¬ 
ventions of the human intellect” 4 a study of their formation and its atten¬ 
dant empirical, mathematical, philosophical, and psychological factors 
may well throw some light on the questions of whether the choice of the 
actually accepted theoretical scheme was unique and of whether a selec¬ 
tion of other factors, under discussion at the time, could have led to an 
alternative formulation of the theory. 

Such conceptual and circumstantial inquiries, of course, do not ex¬ 
clude the possibility of an irrational residue in the process of scientific 
theory construction. If reductions of such irrationalities to their bare 
minima form the historical and methodological components of the con¬ 
tents of a study such as the present, considerations of the kind mentioned 
before which bring this study into relation with current research consti¬ 
tute its scientific component. 5 

By now, the reader will have understood that the present volume, 
which confines itself to nonrelativistic quantum mechanics of a finite 
number of degrees of freedom and which, it is hoped, will be followed by a 
similar treatment of relativistic quantum mechanics and quantum field 
theory, is neither a textbook nor a collection of biographical notes nor 
even a study of priority questions. In fact, even the strict chronological 
order of presenting the material has often been violated in favor of the 
logical coherence of the discussion. The primary objective of this book is 
the study of the problems of how empirical phenomena led to a clash with 
established principles of classical physics, how experimental research, 
combined with mathematical reasoning and philosophical thought, 
opened entirely novel perspectives, and how in the process of constructing 
the conceptual edifice of quantum mechanics each stage depended on 
those preceding it without necessarily following from them as a logical 
consequence. In comparison to these, all other considerations are deemed 
of secondary importance. 

Although various issues, in the development of quantum mechanics 
have been pursued, as the reader will see, to the very front line of current 
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research, the main theme of the study concludes with a discussion of the 
conceptual situation as brought about by the establishment of the so- 
called Copenhagen interpretation. As is well know, this interpretation is 
still epoused today by the majority of theoreticians and practicing physi¬ 
cists. Though not necessarily the only logically possible interpretation of 
quantum phenomena, it is de facto the only existing fully articulated con¬ 
sistent scheme of conceptions that brings order into an otherwise chaotic 
cluster of facts and makes it comprehensible. The exclusion of alternative 
interpretative attempts—which, it is hoped, will be discussed in another 
volume—should therefore not impair the logical completeness and cur¬ 
rent significance of the present study. 

The sources I have been using fall into three different categories. 

1. The main sources are, of course, the papers and monographs written 
by the originators of the theory. Full bibliographical references have been 
supplied in the present work so that the interested reader can easily check 
or pursue any issue involved. Since science consists of recorded verifiable 
or falsifiable general statements, these sources will have to serve as the 
ultimate basis for our discussion as far as the subject matter is concerned. 

2. Einstein once said: “If you want to find out anything from the theo¬ 
retical physicists about the methods they use, I advise you to stick closely 
to one principle: don’t listen to their words, fix your attention on their 
deeds.” For, he continued, “to the discoverer in this field the products of 
his imagination appear so necessary and natural that he regards them, and 
would have them regarded by others, not as creations of thought but as 
given realities.” 6 Generally speaking, Einstein was certainly right. With 
respect to our present subject, however, which deals with conceptual con¬ 
structs that on the whole defy all visualization or picturability and hence 
can hardly be called “necessary and natural,” the danger referred to can¬ 
not be serious. I felt therefore entitled to ignore this warning and discussed 
the subject with quite a number of prominent physicists who contributed 
decisively to the development of the theory. 

Appointments at the Institut Henri Poincare in Paris, at the Federal 
Institute of Technology (E. T.H. ) in Zurich, at the University of Gottingen , 
and at the Catholic University of America in Washington, D.C., gave me 
the opportunity to discuss various issues of the subject with Prince Louis 
de Broglie, Professors Max Bom, Werner Heisenberg, Paul A. M. Dirac, 
Friedrich Hund, Walter Heitler, B. L. Van der Waerden, Marcus Fierz, 
Res Jost, and to receive valuable suggestions from Sir George Thomson, 
Professors E. N. da C. Andrade, John C. Slater, Erwin Fues, and Franz 
Tank as well as from Professors Gerald Holton, Paul K. Feyerabend, and 
Thomas S. Kuhn. I am grateful to these distinguished physicists and phi¬ 
losophers of science for their stimulating comments. 



XVI 


Conceptual Development of Quantum Mechanics 


3. I wish to thank also the U.S. National Science Foundation for hav¬ 
ing granted me the opportunity to study thoroughly the Archive for the 
History of Quantum Physics deposited at the Library of the American 
Philosophical Society in Philadelphia, at the Library of the University of 
California in Berkeley, and at the Universitets Institut for Teoretisk Fysik 
in Copenhagen. The material gathered in this Archive contains numerous 
interviews, recorded in tape and transcript, with many physicists who 
contributed to the development of quantum mechanics. These interviews 
proved highly informative and reliable. However, since there was always 
the danger that the subject interviewed retrojected later conceptions or 
results into his recollections of earlier developments, the conclusions 
drawn from such interviews were critically compared with other sources 
of information whenever available. 

I am also indebted to Mrs. Wolfgang Pauli for her courtesy in arrang¬ 
ing for me to peruse the Pauli Collection in Zollikon-Zurich. 

Having soon realized that in quantum mechanics, though less ostensi¬ 
bly so than in relativity, almost every phase of the development bears the 
mark of Einstein’s ideas, I studied the documentary material of the Ein¬ 
stein Estate at the Institute for Advanced Study in Princeton, N. J. Ein¬ 
stein’s correspondence with the leading originators of quantum mechan¬ 
ics proved indeed most revealing. 

I am greatly indebted to Professor P. A. M. Dirac, who read parts of the 
manuscript when visiting Bar-Ilan University in the winter of 1965-1966. 

Finally, I wish to express my deep gratitude to Professor Werner Hei¬ 
senberg for the privilege he afforded me of discussing with him, at the 
Max-Planck-Institut fur Physik in Munich, various aspects of the work 
and for his kindness in critically reading the entire typescript of this book. 
Needless to say, the full responsibility for any errors or misinterpretations 
rests only upon me. 


Max Jammer 
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CHAPTER 1 

The Formation of Quantum 

Conceptions 


1.1 Unsolved Problems in Classical Physics 

Quantum theory, in its earliest formulation, had its origin in the inability 
of classical physics to account for the experimentally observed energy 
distribution in the continuous spectrum of black-body radiation. 

It was unfortunate that the first problem dealt with in the systematic 
development of quantum theory was that of the energy quantization of 
harmonic electromagnetic vibrations. There can be no doubt that much 
intellectual effort could have been saved on the part of the early quantum 
theoretists—as well as on the part of the reader of the present text—if a 
conceptually less involved issue had initiated the development of the theo¬ 
ry. 

This raises the question: Was it possible that other problems at issue at 
the time, if worked out consistently, could have brought about the same 
conceptual reorientation as did the problem of black-body radiation? And 
perhaps in a logically less complicated way? Consider, for example, the 
well-known irreconcilability with classical physics of the specific heat of 
solids at low temperatures, a problem whose solution was obtained de 
facto in terms of concepts formed in solving the black-body problem. One 
may conjecture how an independent and consistent solution of this prob¬ 
lem on specific heat would have influenced the progress of theoretical 
physics. It seems highly probable that in this hypothetical case energy 
quantization of material systems (atoms or molecules) would have pre¬ 
ceded that of waves and the approach to quantum theory would have been 
conceptually less difficult. 

It may even be argued that the energy quantization of atoms, at least so 
far as their ground state is concerned, could have been inferred without 
great difficulties from the empirical foundations of the classical kinetic 
theory. 1 Did not the fundamental observation that the specific heat per 
mole of a monatomic gas at constant volume is \R clearly indicate, if 
explained as it was in terms of the average kinetic energy \kT ' that any 
energy supplied to the gas can increase only the kinetic energy of the 
atoms and not the energy of the internal motions of their constituent parts, 
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whatever they were thought to be? Was it not common knowledge that in 
the process of atomic or molecular collisions the kinetic energy is con¬ 
served and none of it transformed into internally bound energy? (Today 
this fact is explained by saying that the kinetic energy of particles, which 
at room temperature is of the order of ^ eV, is much too small to excite the 
atom, its excitation energy being of the order of a few electron volts.) In 
retrospect it seems quite possible that a profound critical study of these 
and similar facts could have led to anticipating the idea of energy quanti¬ 
zation of atoms many decades before Bohr and thus to a more convenient 
approach to quantum-theoretic conceptions. For, as it has once been said, 
“research is to see what everybody has seen and to think what nobody has 
thought.” But post iacturam quis non sapitl 

There were, of course, reasons that quantum theory developed just the 
way it did. In order to understand fully how and why just the phenomenon 
of black-body radiation was to play such a decisive role in the process of 
physical concept formation, our analysis of the conceptual development 
of quantum mechanics will begin with a discussion on the foundations of 
the classical theory of thermal radiation. 

“A few weeks ago I had the honor of addressing the Academy with a 
memoir on some observations which seemed to me most interesting as 
they allow drawing conclusions on the chemical constitution of the solar 
atmosphere. Starting from these observations I have now derived, on the 
basis of rather simple theoretical considerations, a general theorem 
which, in view of its great importance, I allow myself to present to the 
Academy. It deals with a property of all bodies and refers to the emission 
and absorption of heat and light.” With these words KirchhofF began his 
classic paper “On the relation between emission and absorption of light 
and heat” 2 which he read before the Berlin Academy in 1859 and in which 
he showed that “for rays of the same wavelength at the same temperature 
the ratio of the emissive power to the absorptivity is the same for all 
bodies.” 3 To prove this theorem, which was later called “KirchhofF’s 
law,” KirchhofF calculated the equilibrium conditions for the radiation 
exchange between two emitting and absorbing infinite parallel plates fac¬ 
ing each other and covered on the outside surfaces by ideal reflectors. He 
assumed that one of the plates emits and absorbs radiation only of wave¬ 
length A whereas the other does so for all wavelengths. Since all radiation 
of wavelength different from A emitted by the second plate is finally reab¬ 
sorbed by the emitter after repeated reflections, the calculation could be 
restricted to the exchange of radiation of wavelength A. If E is the emissive 
power (i.e., the energy emitted at wavelength A per unit time) and A the 
absorptivity (i.e., the fraction of the incident radiation at wavelength A 
absorbed) of one plate and e and a those of the other, KirchofF showed 
that the latter absorbs the amount aE/(\—k) from E, where 
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lc= (1 — A)( \ — a) y and the amount a( 1 — A)e/( 1 — k) from e. Since 
at equilibrium the energy emitted equals the energy absorbed, 4 

aE a{ 1 —A)e 
e = -- 

1 — k l-k 

and hence 

a A 

which proved the theorem. A year later Kirchhoff published a second 
paper 5 in which he treated the same subject in greater detail and supplied a 
more rigorous analytical proof of the theorem. 

From the historical point of view it is interesting to note that it was the 
study of the Fraunhofer lines in the solar spectrum which led Kirchhoff to 
his investigations on the relation between emission and absorption. This is 
shown not only by his own introductory remarks to his first paper, as 
quoted above, but also by the fact that his second paper was part of a study 
entitled Investigations on the Solar Spectrum and the Spectra of Chemical 
Elements . 6 Because of its fundamental importance for the physics of radi¬ 
ation and for the subsequent development of theoretical physics, Kirch- 
hoff’s law was repeatedly subjected to critical examinations and was giv¬ 
en increasingly rigorous proofs. 7 

Simultaneously with Kirchhoff’s publications, Stewart 8 established 
independently the validity of the law on the basis of experimental mea¬ 
surements. Stewart’s paper, like Kirchhoff’s, was based on Prevost’s the¬ 
ory. The primary concern of the Scottish physicist, however, was an ex¬ 
perimental verification 9 of radiation relations, in contrast to that of 
Kirchhoff, who stressed the theoretical aspects involved. In fact, Kirch- 
hoff seems to have fully recognized the theoretical significance of the ratio 
E /A for he emphasized the importance of an exact experimental deter¬ 
mination of this function of temperature and expressed the hope that this 
would raise no special difficulties since “all functions encountered so far 
which are independent of the nature of bodies are of simple structure.” 

In the beginning of his second paper Kirchhoff introduced the notion 
of what he called “a perfectly black , or more briefly black body ,” 10 defin¬ 
ing it—just as it is defined today—as a body which absorbs all the radi¬ 
ation incident upon it. Thus the absorptivity A of a black body being unity, 
its emissive power E becomes identical with the aforementioned universal 
function. On the basis of simple thermodynamic considerations Kirchhoff 
then proceeded to show that cavity radiation, i.e., the radiation inside an 
enclosure of adiathermanous walls of equal temperature T, is of the same 
“quality and intensity” as the emissive power E of a black body 11 of the 
same temperature T. Kirchhoff’s cavity theorem, in modem terminol- 
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ogy, summarized the following situation. If B v = B( v,T) denotes the en¬ 
ergy emitted per unit frequency interval at v, per unit area, per unit solid 
angle, and per unit time in a direction normal to the surface of a black 
body 12 of temperature T so that for a surface obeying Lambert’s law the 
total emissive power over a hemisphere is E = ttSqB v cIv 9 and if u v 
= u(v,T) is the radiation density inside an enclosure of temperature T 
per unit frequency interval at v so that the total energy density is 
u = SoU v dv 9 then 

u v = — B v or u = — E (1.1) 

c c 

a relation which can be proved by simple geometric considerations. 13 The 
second law of thermodynamics implies that u v and u are independent of 
the nature of the walls of the enclosure. 

KirchhofF’s law, the fundamental theorem for the theoretical study of 
thermal radiation, was, as we have seen, historically viewed, the outcome 
of astrophysical or, more precisely, solar-physical investigations. The 
most important single device for the experimental study of thermal radi¬ 
ation owes its existence likewise to solar-physical investigations. For it 
was in the course of his study of the atmospheric absorption of solar 
radiation that S. P. Langley invented in 1880 the bolometer or, as he called 
it at first tentatively, the “actinic balance.” Prior to his appointment in 
1887 as Secretary of the Smithsonian Institution, Langley was professor at 
the Western University of Pennsylvania (today the University of Pitts¬ 
burgh) and in charge of the Allegheny Observatory at Allegheny City, 
which specialized in solar research. Dissatisfied with the available equip¬ 
ment and, in particular, with the thermopile, invented by Nobili and im¬ 
proved by Melloni and Tyndall, Langley, supported by a grant from the 
American Academy of Arts and Sciences, had been, during some time, 
working on the device and construction of an instrument more delicate 
and more prompt than the thermopile. 14 His bolometer, 15 which consisted 
of two blackened platinum strips, 0.5 mm wide and 0.002 mm thick, 
placed in opposite arms of a Wheatstone bridge, improved the sensitivity 
of radiation measurements by at least a factor of 10. The instrument was 
soon employed in spectroscopic research, but its importance for the study 
of black-body radiation was not realized until a convenient source for such 
radiation was made available. 

Although the idea that isothermal enclosures are black-body radiators 
had been suggested, at least implicitly, as early as 1884 by Christiansen, 16 
who stated that “holes” (“Locher”) act like “small completely black 
spots” (“kleine vollkommen schwarze Flecken”), and although Boltz¬ 
mann, with reference to Christiansen, applied this idea in the same year in 
a theoretical analysis of the relation between thermal radiation and ther- 
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modynamics, 17 it was only in 1895, on the basis of Kirchhoff’s cavity 
theorem, that Lummer and Wien constructed the first isothermal enclo¬ 
sure to serve as a source of radiation. 18 This device, combined with Lang¬ 
ley’s bolometer, facilitated considerably the systematic study of black- 
body radiation. This study was carried out toward the turn of the century 
at various centers of research and particularly at the Imperial Physico- 
Technical Institute (“Physikalische-Technische Reichsanstalt”), found¬ 
ed in 1884 in Berlin, Charlottenburg, where Lummer, Pringsheim, Hol- 
bom, Rubens, and others worked on the problem. 

Meanwhile the problem began to be tackled also by theoretical physi¬ 
cists. Having read in Wiillner’s textbook on heat 19 about John Tyndall’s 
experiments, 20 according to which the total emission of a heated platinum 
wire at 1200 °C (1473 °K) was 11.7 times that at 525 °C (798 °K) and 
realizing that (^ 3 ) 4 is approximately 11.7, Stefan 21 concluded in 1879 
that the total radiation E is proportional to T 4 . Stefan claimed that he 
found additional support for the validity of this relation in the measure¬ 
ments performed by de la Provostayes, Desains, Draper, and Ericsson. 
Today we know that Stefan’s inference, based after all on relatively mea¬ 
ger experimental evidence, was to a high degree fortuitous. A modem 
repetition of Tyndall’s experiment—Tyndall measured a radiation which 
was far from that of a black body—would yield a ratio of 18.6 22 rather 
than 11.7. Stefan’s “fourth-power law” for black-body radiation 

E=crT 4 (1.2) 

was put on firm experimental foundations only in 1897 by Paschen, 23 by 
Lummer and Pringsheim, 24 and by Mendenhall and Saunders. 25 

A theoretical proof, however, of Stefan’s law had already been pro¬ 
vided by Boltzmann in 1884. The principal idea of this proof occurred to 
Boltzmann in the course of a critical study of a publication by Bartoli 26 to 
which Wiedemann had drawn his attention. The Italian physicist de¬ 
scribed in a Gedankenexperiment a cyclic process by which, with the help 
of moving mirrors, heat in the form of radiation could be transferred from 
a cold body to a hot one; since according to the second law of thermody¬ 
namics, as formulated by Clausius, such a heat transfer requires the per¬ 
formance of work, he concluded that heat radiation exerts pressure. 
Boltzmann 27 recognized immediately the importance of these qualitative 
conclusions and attempted to obtain a quantitative elaboration in accor¬ 
dance with Maxwell’s electromagnetic theory of light. In a second paper, 
published shortly afterward, Boltzmann 28 showed, in accordance with 
Maxwell’s theory 29 and Kronig’s statistical averaging method, 30 that this 
pressure is equal to one-third of the energy density. Having thus associat¬ 
ed with the “radiative gas” two thermodynamic variables, the tempera¬ 
ture T and the pressure p , Boltzmann proved by thermodynamic reason- 
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ing that u = aT 4 and consequently E = aT 4 . The modem derivation of 
Stefan’s law subjects the radiation gas to a Carnot cycle or, equivalently, 
computes the thermal energy dQ necessary for an isothermal expansion 
and integrates the integrability conditions of the total differential dQ/T. 
This is but a slight modification of a second proof which Boltzmann sup¬ 
plied at the end of his paper. The exact determination of the constant a or, 
equivalently, of the “Stefan constant’’ a = ac /4 was the object of numer¬ 
ous measurements at the turn of the century. The value of a> averaged over 
the results obtained by Kurlbaum, Fery, Bauer and Moulin, Valentiner, 
Fery and Drecq, Shakespeare, and Gerlach, was found to be 
a= 5.672X 10 -5 erg/(sec)(cm 2 )(deg 4 ). 

Neither Stefan’s law nor Boltzmann’s derivation of it paid any atten¬ 
tion to the spectral distribution of the radiation. Boltzmann’s thought 
experiment which involved the reflection of radiation from a moving pis¬ 
ton clearly implied a redistribution with respect to frequency in accor¬ 
dance with the Doppler effect. In order to calculate the change of u v or B v 
due to the Doppler shift, Wien 31 studied the adiabatic contraction of a 
perfectly reflecting sphere and showed that B v must satisfy the equation 
B v — v*F{ v/T), where F( v/T) is an as yet unknown function of the argu¬ 
ment v/T. This equation or its equivalent form 32 

E^=A~ 5 <p(AT) (1-3) 

if referred not to the frequency v but, as is usually done in experimental 
physics, to the wavelength X, was called “Wien’s displacement law.” 33 
Among its numerous experimental confirmations one of the most pains¬ 
taking was that by Paschen. 34 Wien’s displacement law was rightfully 
regarded as an important achievement, for it showed that from the spec¬ 
tral distribution of black-body radiation for any given temperature the 
distribution for any other temperature could be deduced. Another imme¬ 
diate conclusion from the displacement law was the relation 

4na *T=b (1.4) 

stating that the wavelength X mSLX at which E A has its maximum value is 
inversely proportional to the absolute temperature T. Lummer and Pring- 
sheim confirmed 35 this relation and found for the constant b the numerical 
value 0.294 cm deg. The problem of explaining the empirical distribution 
curves of E x or, equivalently, of B v or u v seemed almost to have found its 
solution. It was reduced to the explanation of a single function <p(AT). It 
was clearly understood, however, that—contrary to the numerous char¬ 
acteristic physical constants and functions encountered in thermodynam¬ 
ics, electrodynamics, etc., whose particular values seemed merely to ex¬ 
press more or less accidental properties of matter—the function <p(XT) 
must play a much more fundamental role. In view of the universal charac- 
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ter of black-body radiation, its independence of the particular properties 
of the material substances, and thus, ultimately, in view of the general 
validity of KirchhofF ’s law and the second law of thermodynamics, a 
theoretical explanation of this function was rightfully regarded as a prob¬ 
lem involving fundamental issues of far-reaching importance. In fact, as 
we shall see in the sequel, the inability of nineteenth-century physics to 
account for the experimentally established function cp(AT) signalized the 
breakdown of classical mechanics. For the time being, however, desperate 
attempts were made to explain the empirical data. 

In his search for a theoretical derivation of this function Wien was 
convinced that all previous attempts 36 to solve this problem not only were 
unsuccessful in attaining their aim but attacked the problem from a wrong 
point of departure. The only exception, in his opinion, was in this respect 
Michelson’s “Theoretical essay on the distribution of energy in the spectra 
of solids,” 37 which explained the continuity of these spectra on the basis of 
the irregularity in the vibrations of the atoms. As a consequence of this 
assumption Michelson based his explanation on statistical considerations 
and employed the theory of probability. It was, of course, the classical 
conception of probability as used in statistical mechanics. In fact, Michel¬ 
son assumed that Maxwell’s classic formula for the distribution of veloc¬ 
ities of gas molecules holds also for the molecules of the solid radiating 
black body. 

In Wien’s view Michelson’s approach was a “fortunate idea.” 38 Fol¬ 
lowing these suggestions, Wien assumed that the wavelength A of the 
radiation emitted by any molecule and the corresponding intensity are 
functions of the molecule’s velocity v alone and that, conversely, v 2 is a 
function of A. From the Maxwell-Boltzmann distribution function 
v 2 exp( — v 2 /aT) dv, which determines the number of molecules whose 
velocity lies between v and v + dv , where a is a constant for a given gas, he 
inferred that E k =g(A)exp[ — f(A)/T], where g(A) and f(A) denote 
unknown functions. Since, however, this expression for E k had to be con¬ 
sistent with his previous result (1.3), Wien concluded that 39 

E x = c,A _5 exp( - or u v = av 3 exp( - (1.5) 

where c u c 2 , a , and 0 are constants. 


1.2 The Concept of Quanta of Energy 

Although based on rather questionable arguments, Wien’s law of radi¬ 
ation, as the preceding formula (1.5) was called, seemed to give an ade¬ 
quate account of all experimental data available at that time. Paschen, 40 in 
a series of painstaking measurements, and Wanner, 41 who used for this 
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purpose a photometric method, confirmed the law, at least for the visible 
region, for temperatures up to 4000 °C. Furthermore, a more rigorous 
derivation of Wien’s law of radiation was proposed by Max Planck in a 
series of papers 42 presented to the Berlin Academy of Sciences during the 
years 1897 to 1899. 

Planck, it is known, attended KirchhofF’s lectures in Berlin but de¬ 
rived his main inspiration from Clausius’ writing on thermodynamics, a 
subject which early became his field of specialization. The measurements 
performed by Lummer and Pringsheim at the Imperial Physico-Techni- 
cal Institute in Berlin attracted his attention to the problem of black-body 
radiation. This problem, as he later admitted in his autobiography, 43 had 
for him a special fascination for, in view of its universal character, it 
represented something absolute and he “had always regarded the search 
for the absolute as the loftiest goal of all scientific activity.’’ 

Taking advantage of the fact that according to KirchhofF the radiation 
distribution at equilibrium is independent of the nature of the radiators, 
Planck started with the simplest possible assumption that the radiators 
are linear harmonic oscillators of frequency v. The theory of electric- 
dipole radiation had been well known since 1889, when Hertz published 
his classic paper 44 on this subject. 

Planck most probably based his program of research on the following 
considerations. Wien’s derivation of his radiation law, in spite of its unten¬ 
able theoretical foundations, seemed to produce a correct result. Planck 
therefore assumed that Wien’s approach was probably not altogether 
wrong. The basic element in Wien’s derivation was the Maxwell-Boltz- 
mann velocity distribution, which, according to the kinetic theory of gas¬ 
es, characterizes the equilibrium reached by irreversible processes from 
arbitrary initial conditions. 

Now, since cavity radiation is a process related to electromagnetism 
rather than to the kinetic theory of gases, Planck decided that he could do 
the same for Maxwell’s theory of the electromagnetic field as Boltzmann 
had done for mechanics with his famous, though still disputed, H 
theorem. Planck thus attempted to prove that the Maxwell-Hertz equa¬ 
tions, if applied to the resonators with arbitrary initial conditions, would 
lead to irreversible processes converging toward a stationary state whose 
energy distribution is that of cavity radiation and which therefore deter¬ 
mines the energy spectrum of black-body radiation. In short, what Planck 
seems to have had in mind was a translation of the reasoning that led to the 
Maxwell-Boltzmann velocity distribution on the basis of the kinetic theo¬ 
ry into the conceptual structure of electromagnetic theory. 

In his study 45 of the absorption and emission of electrical waves by 
resonance he thought he had found such an irreversible process in the 
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interaction between absorbing and emitting resonators . 46 When this result 
was disproved by Boltzmann , 47 however, Planck proposed—in analogy to 
Boltzmann’s statistical hypothesis of the so-called “molecular chaos” in 
the kinetic theory—a theory of “natural radiation” according to which 
the harmonic partial vibrations composing a wave of thermal radiation 
are completely incoherent . 48 Equating the emission and absorption rates 
of the resonators, Planck 49 obtained as equilibrium conditions, using only 
principles of classical electrodynamics, the equation 

u v =av z U (1.6) 

where U= U — U(v,T) denotes the average energy of a harmonic oscilla¬ 
tor at temperature 7, a a constant ( 87 r/c 3 ), and u v the energy density as 
defined previously. U could, of course, be easily determined from the 
equipartition theorem of statistical mechanics according to which, loosely 
speaking, the total kinetic energy, under equilibrium conditions, is equally 
divided among the degrees of freedom, or more rigorously stated, the 
same amount of energy (\kT) has to be allotted, on the average, to every 
Hamiltonian coordinate (q or p) in terms of which the Hamiltonian can 
be expressed as a quadratic function. 

In view of the precarious status of the equipartition theorem within the 
subsequent development of quantum physics, it seems appropriate to in¬ 
clude a brief digression on the doctrine of energy equipartition. 

The earliest enunciation of the equipartition theorem, though only 
with reference to translational motion and based on hardly defendable 
arguments, was in a memoir “On the physics of media that are composed 
of free and perfectly elastic molecules in a state of motion” by J. J. Water- 
ston. In this paper, which was later called “a foundation-stone of a new 
branch of scientific knowledge ” 50 since it anticipated a great deal of the 
work of Kronig, Clausius, and Maxwell in the kinetic theory of gases, 
Waterston contended that “in mixed media the mean square molecular 
velocity is inversely proportional to the specific weight of the molecules” 
(the last six words meaning “the mass of the molecules”). A former stu¬ 
dent of John Leslie and since 1839 a naval instructor at the East India 
Company in Bombay, Waterston sent this paper to the Royal Society in 
1845. The Society, however, rejected its publication after one of the two 
referees described it as “nothing but nonsense, unfit even for reading be¬ 
fore the Society.” In accordance with the rules of the Society the manu¬ 
script became its property and was buried in the archives. Only a very brief 
report appeared in the Proceedings . 51 Convinced of the importance of his 
paper, Waterston circulated an abstract privately . 52 Five years later, at the 
21st meeting of the British Association, in Ipswich, a short extract 53 was 
read in which it was stated that “equilibrium of pressure and heat between 
two gases takes place when the number of atoms in unity of volume is 
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equal, and the vis viva of each atom is equal.” But, again, nobody took 
notice of the statement or its potential implications. “It is probable that in 
the long and honourable history of the Royal Society no mistake more 
disastrous in its actual consequences for the progress of science and the 
reputation of British science than the rejection of Waterston’s papers was 
ever made.... There is every reason for believing that had the papers been 
published physical chemistry and thermodynamics would have developed 
mainly in this country and along much simpler, more correct, and more 
intelligible lines than those of their actual development.” 54 More than 
forty years had to pass until the 1845 memoir, under its original title, was 
finally published in the Philosophical Transactions. 55 With reference to 
Waterston’s enunciation of the equipartition theorem, Lord Rayleigh, 
through whose efforts the paper was published, declared in the introduc¬ 
tion: “The omission to publish it at the time was a misfortune which 
probably retarded the development of the subject by ten or fifteen years. It 
is singular that Waterston appears to have advanced no claim for subse¬ 
quent publication, whether in the Transactions of the Society, or through 
some other channel. At any time since 1860 reference would naturally 
have been made to Maxwell, and it cannot be doubted that he would have 
at once recommended that everything possible should be done to atone for 
the original failure of appreciation.” 

Rayleigh’s reference to Maxwell alludes to his paper “Illustrations of 
the dynamical theory of gases,” 56 in which Maxwell elaborated some con¬ 
clusions submitted one year earlier at the Aberdeen meeting of the British 
Association. 57 There he gave his first formulation of the equipartition 
theorem as follows: “Two different sets of particles will distribute their 
velocities, so that their vires vivae will be equal.” At first he considered 
only the case of “smooth spherical particles” but later, in a corollary, 
extended the theorem to the case of a mixture of particles of any form and 
included rotation. 

In 1868 the theorem was further generalized by Boltzmann, who 
proved 58 its validity also for particles which are not necessarily rigid but 
have a number of internal degrees of freedom. Finally, Maxwell 59 re¬ 
moved certain restrictions on the interaction among particles and showed, 
using generalized Lagrangian coordinates for systems with an arbitrary 
number of degrees of freedom, that the equipartition of energy holds even 
if “the material points may act on each other at all distances, and accord¬ 
ing to any law which is consistent with the conservation of energy...the 
only assumption which is necessary for the direct proof is that the system, 
if left to itself in its actual state of motion, will, sooner or later, pass 
through every phase which is consistent with the equation of energy.” 

Yet the very generalizations of the theorem jeopardized its validity. 
For, as Tait 60 put it in his critical examination of Boltzmann’s approach 
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and in his search for an unassailable proof of the theorem: “There can be 
no doubt that each individual particle of a gas has a very great number of 
degrees of freedom besides the six which it would have if it were rigid; the 
examination of its spectrum while incandescent proves this at once. But if 
all these degrees of freedom are to share the whole energy (on the aver¬ 
age) equally among them, the results of theory will no longer be consistent 
with our experimental knowledge of the two specific heats of a gas, and the 
relations between them.” A still more drastic description of the problems 
raised by the theorem was given by Lord Kelvin 61 in a lecture at the Royal 
Institution on April 27,1900, when he said: “The beauty and clearness of 
the dynamical theory, which asserts heat and light to be modes of motion, 
is at present obscured by two clouds. The first...involved the question, 
How could the earth move through an elastic solid, such as essentially is 
the luminiferous ether? The second is the Maxwell-Boltzmann doctrine of 
partition of energy.” 

It should be obvious from these remarks that the scientific literature at 
the end of the nineteenth century, both in England and on the continent, 
contained numerous articles dealing with the doctrine of energy equiparti- 
tion, and there cannot be any doubt that Planck must have had knowledge 
of the equipartition theorem. 

However, fortunately for the future development of physics, Planck 
did not make use of the theorem. It is hard to say whether it was in view of 
these difficulties or because of his unfamiliarity with the Boltzmann- 
Gibbs methods of statistical mechanics 62 or his profound aversion to the 
molecular approach 63 or, finally, because of his strong conviction in the 
power of thermodynamic reasoning based on the concept of entropy. One 
thing is certain: had he used the equipartition theorem at this stage of his 
work, he would necessarily have arrived at the Rayleigh-Jeans law of 
radiation, which is incompatible with experience, and would probably 
have given up further research on this problem. Instead, adopting what he 
later called the “thermodynamic approach,” Planck 64 defined the entropy 
S of an oscillator by the equation 



(1.7) 


where a and b are constants and e is the base of natural logarithms. The 
quantity d 2 S /dU 2 , which proved important in connection with the princi¬ 
ple of the increase of entropy, thus satisfied the equation 


d 2 S _ const 
dU 2 ~ U 


( 1 . 8 ) 


The reasoning which led Planck to this apparently arbitrary definition 
of S in terms of U and v may be reconstructed on the basis of certain 
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remarks found at the end of his fifth communication . 65 From (1.6) and 
guided by the form of (1.5), Planck obtained U = Cvexp( — fiv/T), 
where C is a constant. Solving this equation for T ~ *, which according to 
thermodynamics equals, at constant volume, dS/dU , Planck obtained 
(1.7) by simple integration. Consistent with his assumption concerning 
the irreversibility associated with “natural radiation,” he then showed 
that the “total electric entropy S t = 25 + fsdr, where the summation 
extends over all oscillators and the integration over all volume elements 
dr of the radiation field with entropy density s , is a function of state which 
increases in time and reaches a maximum at equilibrium. Planck now 
assumed that a small amount of energy passes from one oscillator of fre¬ 
quency v, entropy 5, and energy U, to another of frequency v', entropy 5', 
and energy £/'. The entropy and energy principles require that <55, 
= <55 + <55' = 0 and SU + SU ' = 0, which in view of (1.7) lead to the 
equation — (av ) _1 log (U/bv) = — (av') -1 log (U'/bv'). Hence the 
expression on the left-hand side of this equation is a constant for all oscil¬ 
lators considered and therefore, in virtue of ( 1 . 6 ), a common parameter of 
u v for all v. Setting this expression—which has just been shown to be a 
function only of T —equal to T~ l , Planck obtained U = bv exp( — av/ 
T), which, in combination with (1.6), yielded Wien’s radiation law (1.5). 
Being fully aware that the result was determined by the particular choice 
of (1.7), Planck contended that an equation only of the form of (1.5) does 
lead to an expression for 5 which satisfies the entropy principle. 

Before the turn of the century, however, the unrestricted validity of 
Wien’s radiation law (1.5) was seriously challenged. Lummer and Pring- 
sheim 66 recorded systematic deviations for smaller frequencies; when ad¬ 
ditional measurements 67 in the range from 12 to 18 // confirmed their 
suspicion, they had the courage to declare: “It has been demonstrated that 
black-body radiation is not represented, in the range of wavelengths mea¬ 
sured by us, by the Wien-Planck spectral equation .” 68 In addition to these 
objections based on experiment also Planck’s theoretical procedure in 
deriving equation (1.5) became the target of severe criticisms . 69 No won¬ 
der that Thiesen , 70 Lummer and Jahnke , 71 and Lummer and Pring- 
sheim 72 proposed new distribution laws to fit also the experimental data 
obtained for longer waves. 

Meanwhile Lord Rayleigh, in a two-page paper “Remarks upon the 
law of complete radiation ,” 73 published in June, 1900, showed that the 
equipartition theorem of statistical mechanics, if applied to the electro¬ 
magnetic vibrations of cavity radiation, led necessarily to a formula radi¬ 
cally different from (1.5). Rayleigh, an expert in the mathematical treat¬ 
ment of standing waves, as he had already shown in his Theory of Sound, 14 
computed the number N x of modes of free electromagnetic vibrations per 
unit volume in an enclosure and per unit range of wavelength at A and 
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found 75 —if we consider Jeans’s subsequent correction 76 of Rayleigh’s re¬ 
sult—that N a is equal to 87 t/A 4 . Assuming that the average energy of each 
mode at temperature T t according to the equipartition theorem, is (R / 
N) T ' where R is the universal gas constant and N Avogadro’s number, or 
kT, where k is Boltzmann’s constant, introduced at that time by Planck, 
Lord Rayleigh obtained for the energy density per unit interval of wave¬ 
length u x = SirkT/A 4 or equivalently 


u v 


%WkT 

(? 


(1.9) 


This formula, the “Rayleigh-Jeans radiation law,” agreed, of course, with 
Wien’s displacement law (1.3). It also agreed with all experimental data 
in the region of extremely low frequencies, just where Wien’s radiation 
law failed. On the other hand, it was immediately clear that (1.9) must be 
wrong for high frequencies. It assigned no maximum to u v or B v , contrary 
to experience, and led—in view of the unlimited increase for higher fre¬ 
quencies—to a divergent integral for the total energy density w, a situation 
which was later, following Ehrenfest , 77 referred to as the “ultraviolet ca¬ 
tastrophe.” 

However, that for low frequencies and high temperatures u was pro¬ 
portional to T, as required by (1.9) in contrast to (1.5), had meanwhile 
been established irrefutably in a series of measurements carried out by 
Rubens and Kurlbaum . 78 The importance of these measurements for the 
future development of quantum theory is best characterized by Planck 
himself, who admitted that “without the intervention of Rubens the for¬ 
mulation of the radiation law and consequently the foundation of quan¬ 
tum theory would have perhaps taken place in a totally different manner 
and perhaps even not at all in Germany .” 79 A few days before presenting 
their results to the Berlin Academy , 80 which was to convene on October 
25, 1900, Rubens and Kurlbaum reported their observations to Planck. 
Convinced by their report of the inadequacy of Wien’s radiation law, 
Planck realized that his reasoning leading to the Wien formula had to be 
revised so that it would lead to a new formula which for large v and small 
T agrees with Wien’s expression but for small v and large T reduces to a 
proportionality of u v with T. Planck’s point of departure was, of course, 
definition (1.7). Since under the latter conditions with w v , according to 
(1.6), also U has to be proportional to T and since dS/dU = T ~ *, Planck 
inferred that S is proportional to log U or 
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whereas for the former conditions (1.8) has to remain valid. Compromis¬ 
ing, therefore, between (1.8) and (1.10), Planck assumed 81 


d 2 S _ a 
dU 2 U(U+b) 


( 1 . 11 ) 


which, in fact, reduces for small values oiU to (1.8) and hence to Wien’s 
law, and for large values of U to (1.10) and hence to the Rubens-Kurl- 
baum results. 


This interpolation, though mathematically a mere trifle, was one of the 
most significant and momentous contributions ever made in the history of 
physics. Not only did it lead Planck, in his search for its logical corrobora¬ 
tion, to the proposal of his elementary quantum of action and thus initiat¬ 
ed the early development of quantum theory, as we shall see presently; it 
also contained certain implications which, once recognized by Einstein, 
affected decisively the very foundations of physics as well as their episte¬ 
mological presuppositions. Never in the history of physics was there such 
an inconspicuous mathematical interpolation with such far-reaching 
physical and philosophical consequences. 

Now, from this interpolation, Eq. (1.11), Planck deduced that 

1 dS , i U+b 
— =- = a log- 

T dU U 


or 



b 

exp(l /a'D - 1 


where a! — — a/b and b are, of course, still functions of v. To find their 
dependence on v, Planck referred to (1.5) and (1.6) and obtained 

U=v<$> (1.12) 

where 4>(v/T) is a function of v/T. Hence, he concluded, 


and finally 


or 


const v 

exp (c r v/T) — 1 
CA~ 5 

exp(c/AT) — 1 


_ Av 3 
exp (By/T) — 1 

where c\ C, A , and B are constants. 


(1.13) 
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Planck obtained this result just in time to prepare an extended “com¬ 
ment” (“Diskussionsbemerkung”) to follow Kurlbaum’s report to the 
German Physical Society, which met on October 19, 1900. 

In this “comment,” published under the title “On an improvement of 
Wien’s radiation law,” 82 Planck announced formula (1.13), the formula¬ 
tion of what was later called “Planck’s law of radiation.” For the time 
being, it was an empirical formula since its basic assumption (1.11) had 
no rigorous theoretical justification. But it seemed to be a correct formula. 
Rubens, 83 who through the night following the Academy session checked 
it against his experimental results, reported complete agreement, as did 
Lummer and Pringsheim after correcting their own errors of calculation a 
short time afterward. 

To change the status of (1.11) from that of a “lucky guess” (“eine 
gliicklich erratene Interpolationsformel”) 84 to that of a “statement of real 
physical significance,” Planck ultimately found it necessary to abandon 
his “thermodynamic approach” and to turn to Boltzmann’s probabilistic 
conception of entropy. 85 Writing S N for the entropy of a system of N 
oscillators of frequency v, Planck, apparently following Boltzmann, posit¬ 
ed S N = k log W , where W '\s the number of distributions compatible with 
the energy of the system. In order to determine W f Planck had to assume 
that the total energy U N = NU consists of an integral number P of “energy 
elements” e (“Energie-elemente”) so that U N = Pe , for the traditional 
conception of U N as a continuous magnitude would not have admitted a 
combinatorial procedure for the determination of W. Since it is exactly at 
this point that the methodological requirement for a combinatorial proce¬ 
dure motivated Planck’s introduction of the quantum of action which, in 
its turn, led eventually to the development of quantum theory and its 
departure from the principles of classical physics, it is appropriate to 
quote Planck’s first explicit reference to h : “Now we have to consider the 
distribution of the energy U N among the N resonators of frequency v. If 
U N were regarded as an infinitely divisible quantity, the distribution could 
be performed in an infinite number of ways. We consider, however—and 
this is the cardinal point of the whole computation— U N as composed of a 
finite number of discrete equal parts and employ for this purpose the 
natural constant A = 6.55x10“ 27 erg sec. This constant multiplied by the 
common frequency v of the resonators gives the energy element e in ergs, 
and by dividing U N by e we obtain the number P of energy elements which 
are distributed among the A resonators.” 86 

Interpreting Win the equation S N = k log W as the number of possible 
ways of distribution P energy elements e among N oscillators, Planck 87 
obtained 

(N+P- 1)1 


(N— 1 )\P\ 
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and by the use of Stirling’s formula 

(N + P)» +p 
N N P p 


so that 


S N = k[(N + P)log(N + P) — NlogN — P\ogP] 

or finally 




S„ = kN fl+-^Wl+^) 



(1.14) 


Since the entropy S = S n /N of a single oscillator did indeed satisfy Eq. 
(1.11), Planck felt sure he was on the right track. From dS /dU = \/T he 
now obtained for the average energy U of the oscillators of frequency v 


U =--- (1.15) 

exp (e/kT) — 1 

which is compatible with his previous result U = v<I>( v/T) only if e = Av, 
where A is a constant independent of v. Finally, in view of Eq. (1.6) 
Planck arrived at his famous radiation law 


Sttv 2 Av 

c 3 exp(hv/fcT) — 1 


(1.16) 


in agreement with Eq. (1.13). Integrating Eq. (1.16) over all frequencies, 
Planck obtained the Stefan-Boltzmann law and established a relation be¬ 
tween k 4 /A 3 and a\ calculating the frequency at which u v reaches a maxi¬ 
mum, he confirmed Eq. (1.4) and related h/k to b. From the known 
values of a and b Planck computed the numerical value of the constant of 
action and found A = 6.55xl0 -27 erg sec. In addition he computed 
&(1.346x 10“ 16 erg deg -1 ) and, with the help of the gas constant R , 
Avogadro’s number (6.175 X10 23 mole -1 ). Finally, from Faraday’s con¬ 
stant he determined the elementary unit charge e(4.69x 10 -10 esu). 

These results were obtained within a period of about eight weeks which 
Planck described two decades later: “After a few weeks of the most strenu¬ 
ous work of my life, the darkness lifted and an unexpected vista began to 
appear.” 88 At the meeting of the German Physical Society on December 
14, 1900, a date which is often regarded as the “birthday of quantum 
theory,” 89 Planck read his historic paper “On the theory of the energy 
distribution law of the normal spectrum” 90 in which he presented these 
results and introduced the “universal constant A,” destined to change the 
course of theoretical physics. 

It should be noted that Planck’s combinatorial approach differed from 
Boltzmann’s probabilistic method 91 in so far as Planck associated W with 
S N at the equilibrium state without maximizing it. For Planck W was 
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merely the total number of possible complexions and not, as for Boltz¬ 
mann, the number of possible complexions corresponding to the macro 
state which can be realized by the largest number of complexions. The 
reason for this deviation was probably the fact, as already pointed out by 
Rosenfeld, 92 that Planck’s actual point of departure was the expression 
(1.14) for S, to fit conjecture (1.11), and that therefore W, to satisfy the 
equation S N = k log W, necessarily had to be of the form 
(N + P) N+ p /N N P p , which, because of its similarity to the well-known 
combinatorial formula 93 (A + P — 1 )\/P\(N — 1)1, prompted him to 
adopt the combinatorial procedure the way he did. 

It is also interesting to note that nowhere in this paper, nor in any other 
of his early writings, did Planck bring into prominence the fundamental 
fact that U is an integral multiple of hv. At that time Planck apparently 
was not yet quite sure whether his introduction of h was merely a math¬ 
ematical device or whether it expressed a fundamental innovation of pro¬ 
found physical significance. In an unpublished letter 94 (1931), addressed 
to R. W. Wood, Planck described in detail the psychological motives 
which led him to the postulate of energy quanta: he called it “an act of 
desperation,” done because “a theoretical explanation had to be supplied 
at all cost, whatever the price.” As he admitted later in his Autobio¬ 
graphy, 95 he was dissatisfied with his own approach and attempted repeat¬ 
edly, though unsuccessfully, to fit the introduction of h somehow (“ir- 
gendwie”) into the framework of classical physics. On the other hand, his 
son reported how his father, on long walks through the Grunewald, a 
forest in the suburbs of Berlin, intimated to him his feeling of having made 
a discovery comparable perhaps only to the discoveries of Newton. 96 

For the time being, at least until 1905, nobody in fact seems to have 
realized that Planck’s was indeed “a discovery comparable perhaps only 
to the discoveries of Newton.” Germany’s official Physical Abstracts of 
that time, the Fortschritte derPhysik , edited and published by the German 
Physical Society, mentioned Planck’s contribution only in outline (“in 
aussersten Umrissen”) 91 Outside Germany it seems to have attracted still 
less attention. An exception was Arthur L. Day’s report on Planck’s work 
to the 547th meeting of the Philosophical Society of Washington 98 in 
1902. J. H. Jean’s first edition of his Dynamical Theory of Gases, 99 pub¬ 
lished in 1904, contained no reference whatever to Planck’s law. In short, 
Planck’s introduction of h seems to have been regarded at that time as an 
expedient methodological device of no deeper physical significance, al¬ 
though his radiation law was repeatedly subjected to experimental test. It 
was confirmed by Holbom and Valentiner, 100 by Coblentz, 101 and by 
Warburg and his collaborators. 102 On the other hand, as late as 1919 
Nemst and Wulf 103 thought they had found deviations from Planck’s law. 
Subsequent research, however, fully vindicated his result from both the 
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experimental and the theoretical points of view. 104 With the increasing 
number of experimental confirmations of Planck’s law, numerous at¬ 
tempts were made to evade Rayleigh’s conclusion (1.9) without abandon¬ 
ing classical statistical mechanics and, in particular, the equipartition 
theorem. 105 The reason, as Lorentz put it, was undoubtedly that “we 
cannot say that the mechanism of the phenomena has been unveiled [by 
Planck’s theory], and it must be admitted that it is difficult to see the 
reason for this partition of energy by finite portions, which are not even 
equal to each other, but vary from one resonator to the other.” 106 

Another conceptual difficulty, which prevented the general acceptance 
of Planck’s introduction of A, was undoubtedly the following fact. As 
shown by its dimension, this quantity represented an invariable unit of 
“action” (energyXtime) or an “elementary quantum of action” (“ele- 
mentares Wirkungsquantum”), as it was subsequently called. But it was 
clear that no principle of conservation of action exists in physics. It is 
therefore not surprising that the attempt to reconcile Planck’s law with 
classical statistical mechanics was not abandoned even after Lorentz had 
shown that classical physics, that is, the equipartition theorem and Ham¬ 
ilton’s principle, leads necessarily to Rayleigh’s radiation law and its em¬ 
pirically untenable implications. As Lorentz put it, the ether is a system of 
infinitely many degrees of freedom, and the temperature of a ponderable 
body, in thermal equilibrium with it, on this assumption must necessarily 
be absolute zero, a result contrary to experience. 

Lorentz made these statements in a series of lectures which he deliv¬ 
ered in 1910 at the University of Gottingen. In this context the following 
historical comments are not without interest. 

In 1908 the mathematician Paul Wolfskehl 107 of Darmstadt be¬ 
queathed the sum of 100,000 marks to the Academy of Sciences in Gottin¬ 
gen as an award for the first person to publish a complete proof of Fer¬ 
mat’s famous Last Theorem (1637). In this theorem, it will be recalled, 
Fermat denied the existence of integers jc, y, z, and n which satisfy xyz ^ 0, 
n > 2, x n + y n = z”. It is also well known that the theorem has not yet been 
proved but has gained the unique distinction of being the problem for 
which the greatest number of incorrect “proofs” has ever been published. 

What is not so well known, however, is the wise decision of the Wolfs¬ 
kehl committee to use the interest of the amount for the purpose of invit¬ 
ing prominent scientists as guest speakers to Gottingen. Such an invita¬ 
tion brought Poincare there at the end of April, 1909. He gave six lectures 
on problems in pure and applied mathematics. 108 In his first talk (April 
22) he spoke of Fredholm’s equations in connection with the work of Hill 
and Helge von Koch, a subject whose relevance to quantum theory was 
not recognized until 1925; in his last lecture “La Mecanique Nouvelle” 
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(April 28)—the only one he gave in French—he discussed the theory of 
relativity—incidentally, without mentioning the name of Einstein. 

In the following year Lorentz was invited. From October 24 to 29, 
1910, he delivered six lectures on “Old and New Problems in Physics” 109 
which were subsequently edited by Bom and published in the Physika- 
lische Zeitschrift . The last three of these lectures dealt with the problem of 
black-body radiation. Three years later Sommerfeld spoke on problems of 
mathematical physics, and in the summer semester of 1914 Debye gave a 
series of lectures. 

The last scientist to be invited on this program was Niels Bohr. His 
Gottingen lectures, delivered on June 12 to 22,1922, had, as we shall see in 
due course, a decisive influence upon Pauli and Heisenberg. Bohr’s “Sev¬ 
en Lectures on the Theory of Atomic Structure” 110 began with a general 
survey of atomic theory (first lecture), dealt with the correspondence 
principle and the adiabatic principle (second lecture), their applications 
(third lecture), discussed polyelectronic systems (fourth lecture), the 
periodic system (fifth lecture), x-rays and atomic structure (sixth lec¬ 
ture), and concluded with remarks on problems still to be solved. The 
subjects covered in these lectures were essentially the same as contained in 
Bohr’s paper on the structure of atoms, published at that time. 111 

Mathematical research so far seems to have profited very little from 
Wolfskehl’s incitement, and since the inflation in Germany depreciated 
the prize and in view of the historic impact of Bohr’s lectures upon Pauli 
and Heisenberg it is perhaps no exaggeration to say that quantum theory 
was the main beneficiary of the Wolfskehl Prize. Whether this statement 
will have to be modified in view of the recently proposed revival of the 
prize remains to be seen. 

In concluding our discussion on the early development of the concep¬ 
tion of energy quanta, in which Planck’s derivation of the radiation law 
played the dominant role, we think it necessary to stress the following 
critical remarks. 

As we have pointed out, Planck’s derivation consisted of two separate 
parts: (1) a derivation of the relation (1.6) between the radiative energy 
density u v and the oscillator energy U, 

u v =^-v’ 2 U (1.6) 

c 

a formula which Planck obtained by using exclusively the principles of 
classical electrodynamics (as shown in Appendix A); (2) a statistical 
treatment of the interaction among oscillators of different proper frequen¬ 
cies which resulted in the formula (1.15), 
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€ 

exp (e/kT) — 1 


(1.15) 


By combining (1.6) and (1.15) Planck obtained his radiation law (1.16). 
We have also emphasized that these conclusions were adduced by Planck 
in order to provide a logical justification of his far-reaching interpolation 
mentioned above. 

Planck’s reasoning was inconsistent, however, as Einstein, in 1906, was 
the first to recognize. 112 For although either part of Planck’s derivation of 
(1.16) was in itself consistent, their combination was logically incompati¬ 
ble. The reason was this: in the electrodynamical part (1) formula (1.6) is 
based on Maxwell’s theory (see Appendix A) and the assumption that the 
oscillator energy is a continuously variable quantity, whereas in the statis¬ 
tical part (2) this same energy is treated as a discrete quantity, capable of 
assuming only values which are multiples of hv. 

Referring to this inconsistency, Einstein remarked that “if the energy 
of a resonator can change only discontinuously, the usual theory of elec¬ 
tricity cannot be applied for the calculation of the average energy of such a 
resonator in a radiation field. Planck’s theory has, therefore, to assume 
that, although Maxwell’s theory of elementary resonators is not applica¬ 
ble, the average energy of such a resonator, surrounded by radiation, is 
equal to that which would result from the calculation on the basis of 
Maxwell’s theory of electricity.” 

“Such an assumption,” continued Einstein, “would be plausible pro¬ 
vided e = h v were small throughout the observable spectrum compared to 
the average energy U of the resonator; but this is not the case.” 113 

Three and a half years later, at the 81st meeting of the German Associ¬ 
ation of Scientists, held at Salzburg in September, 1909, Einstein 114 spoke 
on the development of our ideas on the nature and constitution of radi¬ 
ation. Repeating on this occasion his challenge to Planck’s reasoning and 
resuming the question of whether the two parts cannot be reconciled with 
each other, Einstein pointed out that the previously mentioned condition, 
namely, that the energy quantum e = hv be small in comparison with U, is 
certainly not satisfied. “A simple calculation shows,” he declared, “that 
e/U for v = 0.5 // and T = 1700 °K is not only not small compared to 
unity, but very large. It is approximately 6.5 X 10 7 .” 

For Einstein this inconsistency was no reason to reject Planck’s quan¬ 
tum theory as such. Having meanwhile proposed his ideas concerning 
light quanta, as we shall see in the next paragraph, Einstein saw in this 
inconsistency an indication that the foundations of the traditional radi¬ 
ation theory, based on Maxwell’s electromagnetic theory, had to be re¬ 
vised. 
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The logical incompatibility of the two parts in Planck’s derivation of 
his radiation law was a matter of great concern also for Peter Debye. 115 
But contrary to Einstein, who hoped to overcome the difficulty by modify¬ 
ing the Maxwellian interaction between resonator and field, Debye at¬ 
tempted to resolve the inconsistency by eliminating altogether the role of 
ponderable resonators in Planck’s derivation. Referring to the Jeans-Ray- 
leigh computation of the number Ndv of vibrations in an enclosure of unit 
volume and frequency interval dv , 


N 


dv = 


87TV 2 


dv 


Debye assumed that the Ndv vibrations consist of/(v) quanta of energy 
content hv each, so that 


u v dv = f(v)dv 

c 


Defining “black radiation” as the most “probable radiation,” that is, as 
the state with the greatest possible number of distributions of the/(v) 
quanta among the Ndv receptors, Debye proved by using Planck’s com¬ 
binatorial formula that in this case/(v) = [exp(kv/kT) — l] _1 ,aresult 
which in combination with the preceding formula implied Planck’s law of 
radiation. Debye thus showed that Planck’s law and its implications fol¬ 
low from the assumption alone that the energy as such is quantized in 
units of hv and no knowledge concerning the properties of resonators or 
their mechanism is needed for this purpose. Debye’s 116 assumption may 
be referred to as the “weak quantum postulate,” in contrast to Planck’s 
“quantum postulate,” according to which also the energy content of an 
oscillator is always a multiple of hv. 


1.3 The Concept of Quanta of Radiation 

In the development of quantum theory discussed so far, the concept of 
energy elements or quanta had been regarded as applicable only to the 
mechanism regulating the interaction between matter and radiation: it 
was the material oscillator of frequency v which could emit or absorb 
energy only in multiples of hv. 

Meanwhile, however, an important conceptual development took 
place which led to a certain generalization of the conception of quanta. It 
began in 1905, when the general validity of the electromagnetic theory of 
light was seriously called into question by Einstein’s article “On a heuris¬ 
tic viewpoint concerning the production and transformation of light.” 117 
In its importance for the future development of theoretical physics this 
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essay may be compared with Einstein’s classic paper on special relativity 
with which it appeared—together also with his famous study on Brown¬ 
ian motion—in the same volume of the Annalen der Physik. Although 
commonly referred to as Einstein’s paper on the photoelectric effect, it 
discussed a problem of much wider significance and contained a sugges¬ 
tion which challenged classical physics perhaps to the same extent as did 
Planck’s historic paper of 1900. 

Einstein considered monochromatic radiation of frequency v and of 
small density within the range of v/T where Wien’s radiation law (1.5) is 
valid. If v is the volume of the enclosure and u (v) the spectral distribution 
function, the entropy could be expressed by the equation S = vfocp(u,v) 
Xdv , where cp is a function of u and of v. In order to find the explicit 
dependence of cp on u and v, Einstein had two equations to start with: 
Sfcpdv = 0, expressing the fact that the entropy for the equilibrium state 
of the cavity radiation is a maximum, and 8fu dv = 0, expressing the 
conservation of energy. Introducing an undetermined multiplier, he ob¬ 
tained, for every choice of 8u as a function of v, the equation 

where A and consequently also dcp /du are independent of v. Taking v = 1, 
Einstein calculated the increase of entropy for dT as 

dS = r ^-dudv 

J v = o du 

or, in view of the independence just proved, dS= (dcp /du) dE, where dE 
is the heat added reversibly and hence subject to dS = dE /T. Comparison 
of the last two equations showed that dcp /du = \/T irrespective of the 
particular form of w(v). Solving Eq. (1.5) for T~ x , Einstein obtained the 
differential equation 


dcp 

du 

and after integration 


(/ffv) ‘log “T 
or 



The entropy of the radiation within the interval from v to v + dv and with 
the energy E v = vu v dv was therefore given by the expression 



If the radiation, originally in volume v 0 , is assumed to occupy volume v, 
the last equation shows that the change in entropy is 
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or equivalently 


S-S 0 = 



S-S 0 = 



(1.17) 


where Ais Avogadro’s number and R the gas constant. On the other hand, 
from the kinetic theory of gases, Einstein argued, it is well known, that the 
probability of finding n particles at an arbitrary instant of time within a 
partial volume v of the volume v 0 in which they were originally moving is 
given by (v/v 0 ) n . Hence 


S-So^logf-V (1.18) 

Guided by the identity of the mathematical structure of (1.17) and 
(1.18), Einstein concluded that E v = n(R/3v/N) and declared that with 
respect to the theory of heat, “monochromatic radiation of small density 
(within the range of validity of Wien’s radiation law) behaves as if it 
consisted of independent energy quanta of magnitude RfSv/N.” 118 Wien’s 
exponential coefficient /?, expressed by Planck’s constants, was of course 
h /k, as Planck’s law readily showed for /zv> kT and R /N = k. In effect, 
therefore, Einstein stated that radiation behaved as if it were composed of 
a finite number of localized energy quanta hv or “photons,’’ as they were 
later called after G. N. Lewis 119 introduced this term in 1926. 

The idea of a discontinuous distribution of radiant energy in space was, 
of course, completely at variance with the prevailing undulatory electro¬ 
magnetic theory of light. Furthermore Einstein’s suggestion of a granular 
structure of radiation seemed to counter one of the most well-founded and 
indisputable results of physical research. Was not the discovery of diffrac¬ 
tion, first reported by Leonardo da Vinci, 120 rediscovered and investigated 
by Grimaldi, 121 and accountable only in terms of the wave theory of Huy¬ 
gens 122 and Young, 123 as Fresnel 124 has so masterly shown, an incontest¬ 
able deathblow to any corpuscular conception of light such as proposed by 
Newton, 125 Laplace, 126 or Biot? 127 Have not Fizeau, 128 Foucault, 129 and 
Breguet, 130 following a suggestion by Arago, 131 established without doubt 
that the velocity of light is less in water than in air and thus brought forth a 
crucial 132 decision of the “particle-versus-wave” issue in favor 133 of the 
latter? As Hanson 134 recently pointed out, these experiments proved the 
undulatory nature of light but certainly did not prove that light cannot 
also be corpuscular. For classical physics has only gradually shaped the 
notions of particle and wave to logical contraries or opposites. In fact, a 
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thorough scholarly study of the history of the logical relationship between 
these two notions, so important for modem physics, and on their develop¬ 
ment to the status of fundamentally incompatible and mutually exclusive 
conceptions still remains a project for future research. In any case, when 
Young declared, “It is allowed on all sides, that light either consists in the 
emission of very minute particles from luminous substances, which are 
actually projected, and continue to move, with the velocity commonly 
attributed to light, or in the excitation of an undulatory motion, analogous 
to that which constitutes sound, in a highly light and elastic medium 
pervading the universe; but the judgments of philosophers of all ages have 
been much divided with respect to the preference of one or the other of 
these opinions,” 135 he obviously used the connective “or” in the disjunc¬ 
tive sense of the Latin aut (and not vel). Arago 136 even considered the 
issue as a mathematically or logically unequivocal dichotomy. Yet, for the 
physics of the later nineteenth century the spatial distribution of energy 
was either discrete, as in the corpuscular-kinetic theory of Newtonian 
mechanics, or continuous, as in Maxwell’s electromagnetic theory, but 
never both discrete and continuous for one and the same category of phys¬ 
ical phenomena. 

Strictly speaking, as Einstein 137 once pointed out, the success of the 
wave theory of light was the first breach in Newtonian physics, for corpus¬ 
cular-kinetic conceptions were replaced by field-theoretic notions. But 
throughout the later nineteenth century these two schemes of conceptions 
enjoyed a rather peaceful coexistence. 

Even Einstein, in the beginning of the paper under discussion, 138 ad¬ 
mitted that the classical theory of light based on continuous space func¬ 
tions was so firmly established that it would probably never be replaced by 
another theory. But, he continued to say, optical observations take ac¬ 
count only of time averages, and it is quite conceivable that such a theory 
of light, in spite of its convincing verifications by experiments on interfer¬ 
ence and diffraction, may prove itself insufficient whenever instantaneous 
values of those functions have to be considered or whenever interactions 
of matter with radiation, as in the processes of emission and absorption, 
are involved. Einstein, it seems, did not know that similar doubts had 
previously been raised by J. J. Thomson. Faced by difficulties in explain¬ 
ing quantitatively the ionization caused by Rontgen rays, as x-rays were 
still called at that time, Thomson declared in 1903: “If, for example, we 
consider a plane at right angles to the direction of propagation of the rays 
the energy is not distributed uniformly over this plane, but the distribution 
of energy has as it were a structure, although an exceedingly fine one, 
places where the energy is large alternating with places where it is small, 
like mortar and bricks in a wall.” 139 The effect which led Thomson to his 
conjecture of “patches of energy,” namely, photoionization, was one of 
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the instances, together with Stokes’ law and the photoelectric effect, for 
which Einstein suggested the application of his new conception of quanta 
of light. 

Owing to Einstein’s paper of 1905, it was primarily the photoelectric 
effect to which physicists referred as an irrefutable demonstration of the 
existence of photons and which thus played an important part in the 
conceptual development of quantum mechanics. It seems appropriate, 
therefore, to discuss it in greater detail. 

It will be recalled that in a series of ingenious experiments begun in 
1886 Heinrich Hertz demonstrated the existence of electromagnetic 
waves and thereby confirmed the electromagnetic theory of light in a way 
which Maxwell himself had feared would never be achieved. The same 
series of experiments, however, which so brilliantly confirmed the electro¬ 
magnetic theory of light—paradoxical as it may sound—also produced 
the first evidence toward its refutation. For it was in the course of these 
investigations that Hertz discovered as early as 1887 that the length of the 
spark induced in the secondary circuit was greatly reduced when the 
spark gap was shielded from the light of the spark in the primary circuit 
and “that ultraviolet light has the power to increase the spark gap of the 
discharge of an inductor and of related discharges.” 140 

Although this statement may rightfully be regarded as the earliest de¬ 
scription of the photoelectric effect which eventually served to disprove 
the universal validity of the electromagnetic theory of light, in the interest 
of historical accuracy, it should be remarked that Schuster 141 and Arrhen¬ 
ius, 142 in the course of their investigations of related subjects, indepen¬ 
dently of Hertz and at about the same time, called attention to essentially 
the same effect, although they were not fully aware of the immediate cause 
of the phenomenon. Hertz refrained from formulating any theory as to the 
mechanism by which radiation facilitates electrical discharge, recogniz¬ 
ing that much more experimental research was needed for such a task. In 
fact, his discovery initiated a long series of investigations. First, Wiede¬ 
mann and Ebert 143 discovered in 1888 that irradiation from an electric arc 
discharges the negative electrode without affecting the positive one. Hall- 
wachs, 144 also in 1888, found on the basis of his experiments with freshly 
polished zinc plates that a rise in temperature does not significantly in¬ 
crease the effect, red and infrared radiation being ineffective, and that 
negatively charged particles seem to be emitted. He also demonstrated 145 
that even an electrically neutral insulated plate, if irradiated, required a 
small positive potential. In the following year Stoletow showed that for a 

o 

certain metal, wavelengths above 2950 A were ineffective, and he devised 
the first cell for the production of a photoelectric current. Stoletow 
found 146 that this current is strictly proportional to the intensity of the 
light absorbed, and he demonstrated that the time lapse between the inci- 
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dence of radiation and the discharge is less than a millisecond. At the same 
time Elster and Geitel, in a long series of measurements with the help of an 
Exner electroscope, systematically examined metals and alloys for their 
photoelectric response to sunlight, to arc light, to the light of a petroleum 
lamp, and finally to that from a glass rod just heated to redness. 147 They 
found that the more electropositive a metal is, the greater is its photoelec¬ 
tric sensitivity and that rubidium, potassium, sodium, and their alloys are 
sensitive even to visible light. 

Once these experimental features were determined, research concen¬ 
trated on the identification of the carriers of the photoelectric current and 
on their physical characteristics. That it was not the molecules of the gas 
surrounding the cathode which were the carriers of this current followed 
from the fact that the effect persisted independently of the gas pressure as 
soon as a certain low pressure had been reached, and continued to persist 
at the highest vacuum attainable. In 1898 Rutherford measured 148 the 
velocity of these carriers of charge and in the same year J. J. Thomson, 149 
on the basis of his famous measurements of the velocity and of the charge- 
to-mass ratio of the electron, suggested identifying those carriers with the 
cathode rays in Geissler tubes. This suggestion found additional support 
in a series of experiments performed by Lenard 150 in which a sodium 
amalgam served as a cathode and a chemically pure platinum wire as an 
anode in an atmosphere of hydrogen while a photoelectric current carry¬ 
ing about 3 //coul was allowed to flow through the circuit. Had sodium 
atoms been the carriers of the charge, the current would have deposited 
almost 10“ 6 mg of sodium on the platinum wire, an amount sufficient to 
be detectable by spectroscopic means. But no trace of sodium was found. 
Having demonstrated that the carriers are electrons, Lenard showed that 
their emission occurs only at a frequency that exceeds a certain minimum 
value v 0 , confirming Stoletow’s result, that the energy of the ejected elec¬ 
trons increased with increasing difference v — v 0 , irrespective of the dis¬ 
tance of the source from the irradiated surface. The existence of such a 
threshold frequency and the fact, also irrefutably verified by Laden- 
burg, 151 that the energy of the photoelectrons is independent of the light 
intensity but proportional to the frequency of the incident light proved 
irreconcilable with Maxwell’s electromagnetic theory of light, but it re¬ 
ceived a most natural explanation by the hypothesis of light quanta as 
proposed by Einstein in 1905. For in view of the supposed corpuscular 
localization or concentration of their energy R/3v/N = hv, each quantum 
could be thought of as being capable of interacting in toto with one elec¬ 
tron only. 

It is well known that all quantitative features of the photoelectric effect 
could be accounted for by Einstein’s photoelectric law, according to 
which the maximum kinetic energy of the photoelectrons is given by (R / 



Formation of Quantum Conceptions 


27 


N)/3v — P or, in modem notation, by hv — P 9 where P 9 a constant for a 
given metal, is the work necessary to remove an electron from the metal. 
Einstein’s photoelectric equation was verified in 1912 by Hughes, 152 who 
measured the maximum velocity of photoelectrons ejected from a number 
of elements (K, Ca, Mg, Cd, Zn, Pb, Sb, Bi, As) which he carefully 
prepared by distillation in a vacuum, and by Richardson and Compton, 153 
who improved the measuring method by using a modem Hilger mono¬ 
chromator and making allowance for the contact-potential differences of 
the electrodes. Einstein’s remark that the stopping potential “should be a 
linear function of the frequency of the incident light, when plotted in 
cartesian coordinates, and its slope should be independent of the nature of 
the substance investigated” 154 found its most striking verification in the 
brilliant experimental work performed by Millikan at the Ryerson Labo¬ 
ratory of the University of Chicago. In his Nobel Prize Address, delivered 
in Stockholm in 1923, 155 Millikan summarized his work in these words: 
“After ten years of testing and changing and learning and sometimes 
blundering, all efforts being directed from the first toward the accurate 
experimental measurement of the energies of emission of photoelectrons, 
now as a function of the temperature, now of wavelength, now of material 
(contact E.M.F. relations), this work resulted, contrary to my own expec¬ 
tations, in the first direct experimental proof in 1914 of the exact validity, 
within narrow limits of experimental errors, of the Einstein equation, and 
the first direct photoelectric determination of Planck’s constant /z.” 

Millikan’s work consisted of a long series of painstaking measure¬ 
ments. As early as 1908, at the Boston meeting of the American Physical 
Society, he was in a position to report some striking results. 156 Essentially 
what he intended to do was measure the kinetic energy of the liberated 
electrons by a determination of the stopping potential, i.e., the potential 
just sufficient to prevent them from reaching a second metallic plate. The 
actual carrying out of these measurements, however, was complicated by 
the necessity of a precise determination of the energy that had to be sup¬ 
plied to the electrons to overcome retaining forces, such as that of the 
image charge, and others which, as functions of the nature of the surface, 
were most difficult to control and reproduce. Millikan worked with the 
alkali metals sodium, potassium, and lithium. With monochromatic 
beams of mercury light he illuminated their surfaces, which he obtained 
clean by cutting shavings in a high vacuum with the aid of a specially 
designed and magnetically operated knife. In 1916 Millikan finally pro¬ 
vided incontrovertible proof of the direct proportionality between the ki¬ 
netic energy of the photoelectrons and the frequency of the absorbed 
light. 157 He showed that the constant of proportionality is independent of 
the nature of the surface and that its numerical value is 6.57 X10 -27 erg 
sec, in close agreement with the value obtained by Planck in 1900. 158 As a 
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result of Millikan’s confirmation of Einstein’s photoelectric equation, the 
quantum of action became a physical reality, accessible directly to experi¬ 
ment, and Einstein’s conjecture of light quanta was endowed with phys¬ 
ical significance and an experimental foundation. 159 

If a particle is defined as a carrier of energy and momentum, Planck’s 
clarification of the physical content of the energy-momentum equations in 
special relativity and electromagnetic theory 160 showed that the quanta of 
light which Einstein always conceived as directed in space were indeed 
particles. For according to Planck every flux of energy is accompanied by 
a transfer of momentum. Hence each quantum of light, moving in empty 
space with the velocity c, carries with it a momentum hv/c. Stark, one of 
the earliest proponents of the hypothesis of light quanta, suggested as 
early as 1909 a possibility of experimentally measuring these momenta. In 
fact, in a paper published in 1909, Stark 161 outlined essentially the theory 
of an experiment by which Compton in 1922 strikingly confirmed Ein¬ 
stein’s hypothesis of light quanta and their transfer of momentum, as we 
shall see in due course. 

Although the full comprehension of the dual nature of light belongs to 
a later stage in the development of quantum theory, its roots may be found 
in one of Einstein’s early papers on black-body radiation. In an article “On 
the present state of the problem of radiation” 162 Einstein calculated the 
fluctuations of radiative energy in a partial volume v of an isothermal 
enclosure of temperature T. Denoting by E the instantaneous energy with¬ 
in the frequency interval between v and v + dv, with E = fEe ~ E/kT 
X dp dq/Se ~ E/kT dp dq its average energy and with e = E — E the energy 
fluctuation, he could easily show that the mean-square fluctuation 
which equals iP — E 2 , where E T 2 = SE 2 e~ E/kT dp dq/fe~ E/kT dp dq , 

is given by = kT 2 dE /dT\ a result which also could be obtained on the 
basis of the quantum-theoretic energy distribution accordingto which E 
assumed only the discrete values E n = nhv. Substituting in E — vu v dv 
for w v , in conformance with Planck’s law, the right-hand member of Eq. 
(1.16), Einstein obtained 


7 = kT 2 v dv 


du v 

~dT 


%irh 2 v 4 v dv e hv/kT 

c 3( e hv/kT_ 


and after eliminating T, again by Eq. (1.16), the equation 


= Ehv + 


c 3 E 


Siry^v dv 


(1.19) 


The second term of the sum in Eq. (1.19) is the mean-square energy 
fluctuation due to interferences between partial waves, as Einstein showed 
in his paper by a simple dimensional analysis and as Lorentz 163 subse- 
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quently demonstrated in rigorous detail. It is therefore exactly the term 
that had to be expected on the basis of the undulatory or Maxwellian 
theory of light. The first term, on the other hand, while inexplicable from 
this point of view, could easily be accounted for, as Einstein pointed out, 
on the basis of his hypothesis of light quanta, according to which the latter 
are subject to the same statistical laws as are particles or molecules in the 
kinetic theory of the ideal gas. In fact, if n denotes the average of the 
instantaneous number n of light quanta in volume v and 8 = n — n the 

fluctuation of n , then — (8hv) 2 = nh 2 v 1 — Ehv since W = n and 
E = nhv. Einstein corroborated his result with a discussion of the Brow¬ 
nian motion of a mirror which he assumed to be a perfect reflector for 
radiation within the frequency interval dv and transparent for all other 
frequencies. For the mean square of the pressure fluctuation he obtained 
again a sum of two terms, one term corresponding to the fluctuation of 
waves and the other to that of particles. Einstein, it seems, did not realize 
at that time the momentous importance of his conclusions. His point was 
primarily to show that the wave theory of light, as conceived at the time, 
was incompatible with Planck’s experimentally well-established law of 
radiation. 

However, at the conclusion of an address on “The Development of Our 
Conceptions on the Nature and Constitution of Radition,” 164 which he 
delivered before the 81st assembly of the German Association of Scien¬ 
tists, held at Salzburg in September, 1909, Einstein proposed, though only 
in outline, a model for his theory of light quanta and their statistical 
behavior. He described each quantum of light as a singularity in space 
surrounded by a field of forces whose magnitudes decrease with the dis¬ 
tance from the singularity at the center. A superposition of these fields, he 
contended, would produce an undulatory field of structure similar to that 
of the electromagnetic field in Maxwell’s theory of light. The purpose of 
this suggestion, as Einstein explicitly stated, was not to construct such a 
model but merely to show “that the two structure qualities, the undula¬ 
tory structure and the quantum structure, both of which Planck’s formula 
ascribes to radiation, are not incompatible with each other.” 165 Concern¬ 
ing the details of a model for light quanta or the details of their theory, 
Einstein pointed out that the first term of the sum in Eq. (1.19) does not 
supply sufficient information, just as Maxwell’s equations cannot be de¬ 
duced from its second term. 

Nevertheless, the search for an acceptable mechanical model to explain 
the mechanism of quanta of light and, more generally, the intimate con¬ 
nection between the corpuscular concept of energy E and the undulatory 
notion of frequency v, a relation nowhere encountered in classical physics 
and yet indisputably assured, was the subject of much discussion at that 
time. The construction of such models, it was hoped, would be particular- 
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ly useful for the clarification of the real meaning of Planck’s constant h. In 
his reminiscences of the early years of quantum theory Max Bom 166 men¬ 
tions the following example of such a model which, by the way, Planck is 
said to have used himself in one of his lectures. Consider, says Bom, an 
apple tree and assume that the length / of the stems of its apples, regarded 
as pendula, is inversely proportional to the square of their height H above 
ground, so that v~/ _1/2 ~//. If now the tree is shaken with a certain 
frequency v, the apples at the corresponding height H will resonate and 
fall to earth with a kinetic energy E proportional to H and consequently 
also proportional to v. Needless to say, models of this kind and even more 
refined macroscopic models did not contribute very much toward an un¬ 
derstanding of Planck’s constant. 

Among the microscopic models, based on the contemporary knowl¬ 
edge concerning the structure of the atom, those proposed by Schidlof and 
by Haas were widely—though not always favorably—discussed. Atomic 
theory, at that time, was still based on J. J. Thomson’s model of the atom, 
which described it as consisting of a positively charged sphere of uniform 
density inside which a compensating number of negatively charged elec¬ 
trons revolve on circular paths. Schidlof 167 modified Thomson’s model by 
assuming that at the center of the positive sphere a negatively charged 
sphere is located whose density equals that of the electrons. On the basis of 
this model Schidlof showed that Planck’s constant is given by the relation 
h = hreiam/N 113 ) 112 , where Aisthenumber of electrons, mtheirmass,e 
their charge, and a the radius of the atom. 

Of greater importance, though likewise based on J. J. Thomson’s as¬ 
sumptions, was a model of the hydrogen atom proposed in 1910 by Arthur 
Erich Haas. 168 Since it made it possible to deduce for the first time the 
Rydberg constant, though only with a wrong numerical factor, in terms of 
the elementary charge of the electron, its mass, and Planck’s constant and 
since it thus led, albeit on the basis of erroneous assumptions, to a result 
which preceded 169 Bohr’s famous derivation of this spectroscopic con¬ 
stant by three years, it is appropriate to discuss Haas’s model and the 
peculiar circumstances which led to its conception in greater detail. 

In December, 1908, Haas completed a scholarly historicocritical anal¬ 
ysis of the principle of energy conservation 170 which he submitted to the 
philosphical faculty of the University of Vienna for his habilitation as 
Privatdocent. The experimental physicists Franz Exner and Victor Lang, 
who were appointed as referees and who knew that Haas had obtained his 
Ph.D. only two years earlier, turned it down, but asked him to renew his 
application after some time with additional research of a more technical 
nature in physics. Haas complied with this request, and within almost a 
year’s time he established a theory of the hydrogen atom, which he later 
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regarded as “the greatest achievement” 171 of his life, and submitted it to 
the Viennese Academy of Science, where it was read on March 10, 1910. 

In the first part of the paper Haas showed that an electron of mass m 
and charge — e , moving along a circle of radius r, concentric with a 
uniformly distributed positive spherical charge e of radius a , attains the 
maximum of its total energy, namely, e 2 /a , if r = a. Assuming also that in 
this case the energy is given by hv ^, where v O0 (= 8.23 X10 14 sec -1 ) is 
the frequency c/A^ , A ^ being the limit of the Balmer expression 

A = ” X 3.647 X10~ 5 cm 

n — 4 

for n = oo (or, in modem notation, v a = Rc/ 4, R being the Rydberg 
constant), and that v a is the frequency of the electronic revolution, Haas 
deduced from the equality between the Coulombian attraction e 2 /a 2 and 

the centripetal force maco 2 = 4ij l mav^ 2 that v a = e/ (lira^ am). Thus, 
on the assumption that the electron revolves on the surface of Thomson’s 
positive sphere, he derived the relations 



which, after elimination of a , implied that 

47 ^me 4 
v =- 

h 3 

(or, in modem notation, R = \6i7 2 me 4 /h 3 c). Had Haas taken hv^ 
= \e 2 /a (which would have agreed with Bohr’s result derived from the 
quantization of the angular momentum), he would have obtained the 
eight times smaller correct Bohr formula R = 2 / n 2 me 4 /h 3 c. In view of the 
relatively great uncertainty in the knowledge of the size of elementary 
constants—it was only in 1911 that Millikan introduced the technique of 
oil droplets, which considerably improved the accuracy in the determina¬ 
tion of e —Haas’s result and his deductions of the values of 
e( = 3.18X 10 -10 esu), of m ( = 5.68xl0 -28 g), and of a 
( = 1.88 X 10 -8 cm) did not bring him into conflict with experience. 

Haas’s work, however, was rejected and even ridiculed. In fact, as 
reported in his unpublished autobiography, when Haas lectured on his 
ideas at a meeting of the Vienna Chemical-Physical Society, Ernst Lecher, 
the director of the Institute of Physics at the Vienna University, called it a 
joke (“ein Faschingsscherz”) and Hasenohrl, when asked for his opinion 
on this matter, remarked that Haas’s jumbling together of two completely 
unrelated subjects such as quantum theory, which was still regarded as a 
part of the theory of heat, and spectroscopy, a branch of optics, was “na¬ 
ive” and should not be taken seriously. It was therefore only in 1912, after 
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the publication of another technical paper on the equilibrium distribution 
of electron groups in the Thomson model of the atom, 172 that Haas ob¬ 
tained his appointment as Privatdocent. 

And yet, although generally rejected, Haas’s work was not completely 
ignored. Explicit reference to it and to his article in the Physikalische 
Zeitschrift 173 was made by Arnold Sommerfeld in an extremely important 
paper which Sommerfeld presented to the first Solvay Congress 174 in 1911 
and on which we shall have to comment in another context later on. 
Furthermore, Niels Bohr, who, as stated in his Rutherford Memorial 
Lecture, 175 learned of Haas’s work only after having accomplished his 
own famous derivation of the Rydberg constant in 1913, got “a vivid 
account of the discussions at the first Solvay meeting” 176 from Rutherford 
and thus probably also of Sommerfeld’s paper, which was to some extent 
influenced by Haas’s conceptions, as we shall see in due course. Could 
such an influence, circuitous and scanty as it may have been, be fully 
established between Haas and Bohr, Haas’s work, although fortuitous 
and based on untenable assumptions, would prove to have been of great 
historical importance for the conceptual development of quantum theory. 

As we have seen, Sommerfeld did not reject the idea of connecting 
Planck’s constant with atomic constants, but he objected to the philoso¬ 
phy on the basis of which such attempts were carried out. He opposed the 
idea that a dynamic interpretation of h had to be found. In an address to 
the 83d meeting of the German Association of Scientists, held in 1911 in 
Karlsruhe, he declared that he thought it futile to explain h on the basis of 
molecular dimensions; the existence of molecules should rather be educed 
as a “function and consequence of the existence of an elementary quantum 
of action.... An electromagnetic or mechanical ‘explanation’ of A,” he stat¬ 
ed, “seems to me just as worthless and unpromising as a mechanical ‘ex¬ 
planation’ of Maxwell’s equations. It would be much more profitable to 
science if the numerous consequences of the h hypothesis were investigat¬ 
ed and other phenomena were retraced to it.” 177 Modem quantum me¬ 
chanics, which views Planck’s constant A as a kind of connecting link 
between the concept of waves and that of particles, fully justified Sommer¬ 
feld’s contention. Something which delimits and restricts the validity of 
models based either on the particle conception or on the undulatory con¬ 
ception cannot itself be represented by a model and certainly cannot be 
explained in the terms of classical physics. 

This argument, however, had not yet been proposed in the early years 
of the century. In fact, it was that very belief in the unrestricted validity of 
a model that obstructed the general acceptance of Einstein’s idea of quan¬ 
ta of light. For in spite of the force of its argument and the cogent reason¬ 
ing behind its statistical conclusions, Einstein’s hypothesis of localized 
quanta of light was still regarded as unacceptable. Even Lorentz rejected 
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these ideas. Experiments had been performed by Lummer and Gehrcke 178 
in which two rays of the green mercury line with a phase difference of 
about two million wavelengths were shown to be still capable of being 
brought into interference, and these experiments clearly indicated that the 
spatial extension of quanta of light in the direction of their propagation, if 
such quanta existed as presumably incoherent entities, was at least a me¬ 
ter. Similarly, the improvement in resolution due to the increase of large- 
scale apertures of telescopes, for example, was thought to be inexplicable 
unless the lateral dimensions of these quanta were of the same order of 
magnitude. How then, asked Lorentz, for example, 179 was it possible for 
the human eye to see? Its pupil could admit only a small fraction of a 
quantum whereas, according to the basic tenets of the hypothesis, a whole 
quantum would be necessary to affect the retina. Does not the experiment 
of Newton’s rings, in which a ray of light divides by reflection into two 
parts which, after proceeding over different paths, reunite for interfer¬ 
ence, clearly demonstrate the divisibility of quanta? Inconsistencies like 
those raised by Lorentz and others were the first examples of the difficul¬ 
ties originating from an indiscriminate application and simultaneous use 
of the models of wave and particle. It was mainly because of these difficul¬ 
ties that Einstein’s corpuscular conception of light quanta met such strong 
opposition. 

Incidentally, there exists an interesting document which ostensibly 
puts on record the reaction toward Einstein’s notion of light quanta, prior 
to Millikan’s final empirical confirmation of the photoelectric equation, 
on the part of Germany’s most prominent physicists. To ensure the elec¬ 
tion of Einstein on June 12,1913, to membership in the Prussian Academy 
of Science, in replacement of the recently deceased J. H. van’t Hoff, the 
noted Dutch physical chemist and first Nobel laureate in chemistry, four 
of the most eminent German physicists, Planck, Warburg, Nemst, and 
Rubens, submitted to the Prussian Ministry of Education a petition in 
which they recommended Einstein for this vacancy. In this signed docu¬ 
ment they described Einstein’s work on the special theory of relativity, his 
contributions to the quantum theory of specific heats—which we shall 
discuss in the next section—and his treatment of the photoelectric and 
photochemical effects. In concluding their recommendation, they de¬ 
clared: “Summing up, we may say that there is hardly one among the great 
problems, in which modem physics is so rich, to which Einstein has not 
made an important contribution. That he may sometimes have missed the 
target in his speculations, as, for example, in his hypothesis of light quan¬ 
ta, cannot really be held too much against him, for it is not possible to 
introduce fundamentally new ideas, even in the most exact sciences, with¬ 
out occasionally taking a risk.” 180 It is instructive to compare this assess¬ 
ment of Einstein’s work on light quanta with its evaluation at a later 
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period as, for example, with K. T. Compton’s description of it as “a contri¬ 
bution to physical theory certainly comparable in importance, and thus 
far more useful in its applications than his more impressive and wider 
publicized general theory of relativity.” 181 

It should be understood that the ultimate reason for Einstein’s statisti¬ 
cal discovery of the duality of lights as wave and particle was Planck’s law 
of radiation. In fact, when Planck, in order to fit his theory with experi¬ 
ence, interpolated between 


d 2 S _ const 
~dU 2 ~ U 

which led to Wien’s law of radiation, and 

d 2 S _ const 
~dU 2 ~ U 2 


( 1 . 8 ) 


( 1 . 10 ) 


which led to the Rayleigh-Jeans formula, the resulting equation (1.11), 


d 2 S _ a 
~dU 2 ~ U(b+ U) 


( 1 . 11 ) 


was precisely an amalgamation of the wave and particle aspects of radi¬ 
ation, the first of its kind in the development of quantum theory, though, 
of course, not recognized as such by its author. 

This may be understood from the following analysis. If radiation were 
treated solely according to Eq. (1.10) or, equivalently, according to the 
Rayleigh-Jeans formula (1.9), a straightforward calculation of the energy 
fluctuation would yield only the second term in the sum of Eq. (1.19), 
corresponding to the wave picture of light. If, however, Eq. (1.8) or 
Wien’s law, Eq. (1.5), were taken as the point of departure, the fluctu¬ 
ation formula would be reduced to the first term of the sum in Eq. (1.19), 
corresponding to the particle picture of light. Alternatively, and without 
calculation, this conclusion also follows from the fact that for low fre¬ 
quencies, the range of validity of Eq. (1.10), the first term of the sum 
under discussion can be neglected in comparison with the second, while 
for high frequencies, where Eq. (1.8) is valid, Eq. (1.19) reduces to 
e 2 = Ehv, the fluctuation formula for the particles of an ideal gas. 

The reader will now understand our reference to Planck’s interpolation 
formula (1.11), announced on October 19, 1900, as containing “certain 
implications which, once recognized by Einstein, affected decisively the 
very foundations of physics.” 182 

It makes little sense, of course, to regard a certain day as “the birthday” 
of a development conceptually so complicated as that of quantum theory. 
However, an inquiry concerning the earliest formulation of a statement or 
formula which characterizes the fundamental idea of a theory seems to 
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have a place in a conceptual analysis such as ours. From this point of view 
we would prefer to consider October 19, 1900, as the birthday of our 
theory and not, as Max von Laue 183 and others suggested, December 14, 
1900. The following considerations will explain our choice. 

December 14, 1900, has usually been regarded as this date because on 
that day Planck introduced his constant h and in terms of it the discrete 
energy spectrum of a harmonic oscillator of given frequency. Now, it can 
be shown 184 that discontinuous change is not necessarily an essential char¬ 
acteristic of quantum theory. It ought also to be noted that as early as May 
18, 1899, a year and a half before December 14, 1900, Planck had read a 
paper 185 before the Berlin Academy in which he calculated on the basis of 
Wien’s radiation law and certain experimental data obtained by Kurl- 
baum 186 and Paschen 187 the numerical value of a constant which he called 
b and found that b = 6.885 X10 -27 cm 2 g/sec. He also recognized it as a 
fundamental natural constant. For, together with the velocity of light 
c = 3.00X10 10 cm/sec, the gravitational constant/ = 6.685 X10 -8 cm 3 / 
g sec 2 , and another radiation constant 188 a = 0.4818 X10“ 10 sec deg, it 
made it possible to introduce a system of natural units (“natiirliche Mass- 
einheiten”) “which in view of their independence of particular bodies or 
substances are of everlasting significance also beyond the limits of our 
global human civilization.” 189 Planck’s introduction of h , though under a 
different name and without reference to the physical dimension of ac¬ 
tion, 190 as well as its recognition as a fundamental natural constant, 
though not yet with respect to the discreteness of energy values, preceded 
considerably, as we see, the date of December 14, 1900. Since, however, 
the wave-particle duality may be regarded, in our view, as the essential 
characteristic of quantum theory, since this duality was implicitly con¬ 
tained, as we have shown following Einstein, in Planck’s paper of October 
19,1900, and since it was ultimately the interpolation formula (1.11) for 
the logical corroboration of which Planck turned to Boltzmann’s statistics 
and quantized the energy of the harmonic oscillator—or, in modem ter¬ 
minology, the energy of the field of electromagnetic harmonic vibra¬ 
tions—our choice of October 19,1900, as the birthday of quantum theory 
seems to be justified. 

It should be noted, however, that in 1978 the well-known historian and 
philosopher of science Thomas S. Kuhn challenged the standard view 
which regards 1900, the year in which Planck introduced the quantum of 
action, as the beginning of quantum physics. According to Kuhn, 
Planck’s reasoning in 1900 and at least until 1906 was completely classical 
and, in particular, the idea, energy must be treated as a discrete rather 
than a continuously varying quantity, played no role in Planck’s delibera¬ 
tions in those years, “...though the structure of the energy continuum is 
fixed by the energy element A,” argued Kuhn, “the motion of Planck’s 
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resonators remains continuous, both within the elements constituting that 
continuum and from one to the next... nothing in Planck’s published pa¬ 
pers, known manuscripts, or autobiographical fragments suggest that the 
idea of restricting resonator energies to a discrete set of values had even 
occurred to him as a possibility until others forced it upon him during 
1906 and the years following.” 1903 

This “historiographic heresy,” as Kuhn himself called his unconven¬ 
tional reinterpretation of Planck’s writings, attracted much attention. 
Thus, for example, one of the leading periodicals in the history of science, 
Isis , published a special symposium in which a noted historian of science, a 
distinguished philosopher of science and an important sociologist stated 
their reaction to Kuhn’s arguments. 19015 

Considering this symposium and the numerous reviews of Kuhn’s 
book, among them Peter Galison’s excellent analysis of the issue under 
discussion, 1900 we may say that the majority of historians of modem phys¬ 
ics retain 1900 as marking the beginning of quantum physics even if they 
concede that Planck was not fully aware of the real meaning of what he 
had been doing at that time and that it was due primarily to the indepen¬ 
dent work of Einstein and Ehrenfest in 1905 as well as to the influence of 
Lorentz, that physicists were finally “converted” to believe in the discon¬ 
tinuity of energy. 

In his apologetic essay “Revisiting Planck” Kuhn raised the question 
of why it should really matter whether Planck or Einstein discovered the 
quantum discontinuity. His answer is twofold. As historians of science we 
are interested in “getting the facts straight.” As philosophers of science— 
and here Kuhn related explicitly for the first and only time the discontin¬ 
uity issue to his much debated theory of scientific revolutions—we believe 
that “history can provide data to the philosopher of science and the episte- 
mologist, that it can influence views about the nature of knowledge and 
about the procedures to be deployed in its pursuit. Part of the appeal of the 
standard account of Planck’s discovery is the closeness with which it 
matches a still cherished view of the nature of science and its development. 
Though I appreciate both the charms and the functions of that view, 
understanding requires that it be recognized as a myth.” 190d 


1.4 Elaborations of the Concept of Quanta 

In the early development of quantum theory, as we have seen, two concep¬ 
tions of quanta had been proposed: Planck’s energy quanta, associated 
with the interaction between radiation and matter, and Einstein’s light 
quanta, associated with radiation in transit. Both were described by the 
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same equation e = hv. Yet in spite of this identity in formula, Planck’s 
conception was universally accepted as indispensable for further research 
while Einstein’s notion was generally rejected for reasons previously ex¬ 
plained. 

Also Planck refused to accept the idea of light quanta. Einstein, it will 
be recalled, drew attention to the logical inconsistency in Planck’s deriva¬ 
tion of the radiation law as being based on a simultaneous espousal and 
rejection of classical electrodynamics; 191 but, viewing Planck’s law as an 
empirically established fact, he saw in this inconsistency an argument 
against the unrestricted applicability of Maxwell’s electrodynamics and 
thus an argument in favor of his own ideas on light quanta. Now, in 
striking contrast to Einstein, Planck attempted to resolve this inconsis¬ 
tency by a stricter adherence to Maxwell’s electrodynamics. 

When Einstein raised his objections at the Salzburg meeting, Planck’s 
reply, in which he described Einstein’s “atomistic interpretation” of radi¬ 
ation and his “rejection of Maxwell’s equations” as “an as yet unnecessary 
step,” 192 already contained the germ of his subsequent modification in 
deriving his radiation law. Referring to Einstein’s thought experiment on 
radiation fluctuations and their effect on a movable mirror, 193 Planck 
commented that Einstein’s reasoning would be conclusive provided one 
really understood the interaction between radiation and matter; “but if 
this is not the case, the bridge which is needed to connect the motion of the 
mirror with the intensity of the incident radiation is missing. Indeed,” 
Planck continued, “the interaction between free electric energy in the 
vacuum and the motion of ponderable atoms, I think, is little understood. 
It is essentially the process of emission and absorption light.... But just 
emission and absorption are an obscure point on which we know so little. 
Perhaps we know something of absorption, but how about emission? One 
pictures it as produced by accelerations of electrons. But this is the 
weakest point of the whole electron theory. One assumes that the electron 
has a definite volume and a definite finite charge density ... in violation of 
the atomistic conception of electricity. These are not impossibilities but 
difficulties, and I almost wonder that no stronger objection has been 
voiced. It is at this very point, I believe, where the quantum theory can be 
useful. ...” 194 

It was precisely in connection with this issue and in this spirit that at 
the end of 1911 Planck completed his so-called “second theory,” which he 
presented to the German Physical Society on January 12,1912. In it 195 he 
proposed to derive the radiation law in a way which he thought “is free of 
inner inconsistencies” and which “does not depart from the core of the 
classical electrodynamics and electron theory more than is absolutely nec¬ 
essary in view of the undeniably irreconcilable differences with the quan¬ 
tum hypothesis.” 
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Contrary to his first theory and in closer agreement with Maxwell’s 
theory Planck now suggested that the absorption of electromagnetic ener¬ 
gy by oscillators is a continuous process. On the other hand, he contended, 
emission occurs only in integral energy quanta 6 and is a discontinuous 
process regulated by a law of chance; not every oscillator after having 
accumulated the amount 6 during the absorption necessarily emits this 
quantum, but only a fraction 77 of such oscillators does so. Planck now 
argued as follows. Of a total of N oscillators Nw n have at a given instant of 
time t energies between ne and (n + 1)6, where n = 0,1,2,... If dV/dt is 
the rate of increase of energy content per oscillator during the time inter¬ 
val dt , the energy threshold (n + 1)6 is passed by all those oscillators 
whose energy, at the beginning of dt , was between (n + 1)6 and 
(n + 1)6 — dV. On the assumption of a uniform distribution, Nw n dV/e 
is the number of oscillators within dV , and of these rjNw n dV/e emit their 
total energy while (1 — 7j)Nw n dV/e pass the aforementioned energy 
threshold to replace the Nw n +l dV/e oscillators which at the beginning 
of dt were in the energy interval from (« + 1 )6 to (« + 1)6 + dV. Under 
conditions of equilibrium (1 — rj)Nw n dV/e = Nw n + x dV /e or w n+x 
= w n (1 — 77 ). Thus w n = w 0 ( 1 — 77 )" and, since 2qNw„ = N, co 0 — 77 . 
The average energy of the oscillators between ne and (n + 1)6 being 
(n + \)e 9 the total energy E is 


E = ^Nw n (n + ^)€ = ^- — + 


V 


Hence the average energy per oscillator is 

u=-£■ + . 1=2. 

2 i) 

Planck now determined the dependence of U on the temperature T. The 
thermodynamic probability of the distribution is W = N\/Rq (Nw n )\ 
and hence the entropy is 

kN 

S = — kN log 77 — : —(1 — 77 ) log (1 — 77 ). 

V 

Calculating dS/dE = ( dS/dr]) ( dy/dE) = T~\ Planck obtained 
T~ l = — ( k/e ) log( 1 — 77 ) or 1 — 77 = exp( — e/kT ), so that finally 

hv 


2 77 2 exp(/zv/fc!T) — 1 


an expression which differed from Planck’s previous value of U (cf. p. 16) 
by the additive constant hv/ 2. In order to obtain the radiation law Planck 
had to introduce an additional hypothesis. He postulated that the ratio 
between the probabilities of emission and absence of emission is propor¬ 
tional to the reciprocal rate of the absorption: y/( 1 — 77 ) = a/(dV/dt). 
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Thus the more slowly an oscillator passes the critical threshold ne , the 
greater is the probability of emission. Since at equilibrium the brightness 
K v (cf. note 12) and consequently also the energy density u v = (8 ir/c)K v 
are proportional to dV /dt, Planck obtained 


«v = P 


i -v 

V 



exp 


hv 

~kT 


where the constant (3 by the use of the Rayleigh-Jeans formula [lim( v/ 
T) = 0] turned out to be 8 irhv*/c 3 , so that Planck’s original law of radi¬ 
ation (1.16) has been recovered. 

Although soon rejected by its author and replaced by a “third theory,” 
which regarded emission as well as absorption as continuous phenome¬ 
na, 196 Planck’s “second theory” deserves our attention for at least three 
important reasons. In the first place, from the new expression of the aver¬ 
age energy U of the harmonic oscillator Planck deduced that at absolute 
zero the energy is not zero, as implied by his original formula for U ., but 
equals \hv. The notion of “zero-point energy” thus made its first appear¬ 
ance in modem physics; though forged by an untenable theory, it was 
eventually, as is well known, vindicated by quantum mechanics. The oth¬ 
er point of interest is that Planck’s “second theory” contained probably 
the earliest suggestion, as far as quantum theory is concerned, that ele¬ 
mentary processes are subject to a law of chance. For Planck assumed that 
the oscillator, while absorbing energy continuously from the surrounding 
field of radiation, accumulates this energy and that its energy content may 
therefore have any value whatever; as soon, however, as its energy content 
reaches the amount hv, it has a definite probability of emitting the quan¬ 
tum hv. It should be emphasized that Planck’s conception of probability 
was the classical conception as used in the kinetic theory of gases. For he 
stated explicitly: “Not that causality is denied for emission; but the factors 
which determine emission causally seem to be so deeply hidden that for 
the time being their laws can be only statistically ascertained.” 197 Planck 
obviously regarded the use of the law of chance, which, as he said, “always 
signifies a renunciation of the completeness of causal interconnection,” 198 
merely as a provisional device. In fact, this was the main reason why in his 
own opinion his new theory was but a “hypothetical attempt” to reconcile 
the law of radiation with the foundations of Maxwell’s doctrine, and not a 
final solution of the problem. 

The third point of interest is his method of determining this probabili¬ 
ty. Having postulated that the ratio between the probabilities of emission 
and absence of emission is proportional to the reciprocal rate of the ab¬ 
sorption or, equivalently, that the ratio of the probability of no emission to 
that of emission is proportional to the intensity of the exciting vibra¬ 
tion, 199 Planck determined the constant of proportionality by recourse to 
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the limiting case of extremely strong radiations where Rayleigh’s formula 
and classical dynamics become valid. 200 Planck’s derivation of the con¬ 
stant of proportionality as proposed presently was probably the earliest 
instance in quantum theory of applying what more than ten years later 
became known as the “correspondence principle” and what proved to be 
immensely fruitful not only for the determination of constants through 
such limiting considerations but also for the inference of conclusions of a 
much wider scope. 

Those who rejected Einstein’s light quanta had, of course, to refute the 
statistical evidence he had adduced in favor of this conception. As long as 
only the experimental evidence of the existence of interference and diffrac¬ 
tion served as the basis for this rejection, no clear-cut decision could be 
made. Thus, various attempts to disprove Einstein’s conclusions were 
made at the turn of the first decade of the century. For example, in his 
lectures delivered at Columbia University in 1913 on “Recent Problems in 
Theoretical Physics,” 201 in which he rejected Einstein’s conception of 
light quanta for reasons similar to those adduced by Lorentz, 202 Wien 
attempted to invalidate Einstein’s conclusion regarding Eq. (1.19) as fol¬ 
lows. He pointed out that in virtue of Eq. (1.16), with the abbreviation 
x = hv/kT ’ 

Ehv _ c 3 E 2 
e x — 1 8 / irv 1 vdv 

an expression which he called B. Denoting Ehv briefly by A, so that 

A = (e* — 1)2?, Wien obtained e 7 = A -1- B = Be*. “This formula,” he 
concluded, “shows that there is no special reason to attribute the fluctu¬ 
ations to two causes which are independent of each other.” 203 

Those who rallied to support Einstein’s idea of light quanta thought it 
imperative to investigate their relation to Planck’s conception of quanta. 
An important advance in this direction was made when two simultaneous 
yet independent papers were read in 1911; Ehrenfest, in an address to the 
Physical Society of Petersburg, 204 and Natanson in a paper read before the 
Academy of Sciences in Cracow, 205 showed that Einstein’s hypothesis of 
noninteracting light quanta does not lead to Planck’s radiation law (1.16) 
but only to Wien’s law, Eq. (1.5), which was known to be a special case of 
the former. 

In his analysis of the precise assumptions underlying Planck’s combin¬ 
atorial procedure in which P energy elements e were distributed among N 
“receptacles of energy” so that Nj receptacles each contain j energy ele¬ 
ments, subject to the restrictions 2A, = N and 2/A, = P, Natanson 
called the correlation of je with N j9 that is, sorting the receptacles accord¬ 
ing to their energy content, a “mode of distribution.” In setting up a 
“mode of distribution,” Natanson emphasized, no account is taken of a 
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possible “identifiability” or distinguishability, as we would say today, of 
receptacles or of energy elements. However, as soon as the former are 
regarded as individually identifiable, every given “mode of distribution” 
ramifies into a number of “modes of collocation” which specifies the num¬ 
ber of energy elements in each individual receptacle. Finally, if the energy 
elements are also considered as identifiable, each “mode of distribution” 
splits into a number of “modes of association” which associates individual 
energy elements with individual receptacles. Natanson then pointed out 
that the thermodynamic probability of a given “mode of distribution” 
depends notably on whether all “modes of association” or all “modes of 
collocation” are regarded as equally probable, and he showed that Planck, 
in contrast to Einstein, adopted the latter alternative. It was in Natanson’s 
analysis of Planck’s statistical procedure that the problem of the distin¬ 
guishability of elementary entities was raised for the first time. 

In his article of 1911 and later in his appendix to the paper quoted in 
Ref. 87, Ehrenfest distinguished between “identical and discrete quanta” 
(“gleichartige, von einander losgeloste Quanten”) such as Einstein’s 
quanta of light, and “nondisjointed quanta” (“nicht von einander losge¬ 
loste Quanten”) such as Planck’s. He based this distinction on their differ¬ 
ent statistical behavior. For, if A x is the number of ways of distributing P 
energy elements of the former kind among receptacles or cells in space and 
A 2 the corresponding number for N 2 receptacles, Einstein’s reasoning, as 
Eq. (1.18) shows, postulates that A X A 2 = N X :N 2 since v = N x units of 
volume and v 0 = N 2 such units. If, however, A x and A 2 refer to energy 
quanta as conceived by Planck, Ehrenfest continued, then 

+ P — 1)! (N 2 + P — 1)! 

1 2 (N x — 1)!P! ' (JV 2 -DIP! 

Hence, Ehrenfest concluded, “nondisjointed quanta” correspond to 
Planck’s law of radiation whereas “disjoint quanta” as conceived by Ein¬ 
stein lead to Wien’s law. It is clear that Ehrenfest’s distinction is essential¬ 
ly that between the number of ways in which P distinguishable objects 
may be placed into N receptacles and the number of ways in which P 
indistinguishable objects may be placed into N receptacles. An attempt to 
modify the Einsteinian conception of quanta so as to reconcile it with 
Planck’s law of radiation was made, also in 1911, by Joffe. 206 Similarly, a 
few years later, Krutkow showed that the two assumptions that any reso¬ 
nator of frequency v possesses only energy values 0, hv, 2hv ,... and that 
these energy values result from “appositions” (“Aneinanderlagerun- 
gen”) of discrete energy elements hv lead to Wien’s and not to Planck’s 
law of radiation; to obtain Planck’s law, the second assumption has to be 
abandoned. The question of what conditions have to be imposed upon 
quanta was the subject of a prolonged dispute between Krutkow and 
Wolfke. 207 



42 


Conceptual Development of Quantum Mechanics 


For the time being, none of these considerations succeeded in clarifying 
a still obscure situation. No wonder that Lorentz, in the opening state¬ 
ment of his address to the Solvay Congress held in Brussels from October 
30 to November 3,1911, still called black-body radiation a “most myster¬ 
ious phenomenon and a most difficult one to unveil.” 208 The Solvay Con¬ 
gress of 1911, attended among others by Marcel Brillouin, Maurice de 
Broglie, Einstein, Jeans, Kamerlingh Onnes, Nemst, Planck, Rubens, 
Rutherford, Sommerfeld, Warburg, and Wien, was convened with the 
declared purpose of attempting to achieve some clarification of the com¬ 
plicated conception of quanta. On one point everybody was agreed: that, 
as Nemst put it at the opening of the Congress, “the fundamental and 
fruitful ideas of Planck and Einstein must serve as the basis for our discus¬ 
sions; they may be modified and elaborated, but they cannot be ig¬ 
nored.” 209 In fact, it was only a few weeks after the Congress that Poincare 
published his paper “On the theory of quanta” 210 in which he conclusively 
demonstrated that “the hypothesis of quanta is the only one which leads 
to Planck’s law of radiation” and that the existence of a discontinuity in 
the probability function of the energy distribution is a necessary conse¬ 
quence of any assumed radiation law which leads to a finite total radi¬ 
ation. 

Planck 211 contributed an important paper to the Congress in which he 
showed how classical statistical mechanics had to be modified in order to 
yield his radiation law (1.16) instead of the Rayleigh-Jeans law (1.9). 
According to the classic theory of Gibbs 212 the probability of finding the 
representative point for the state of a system in the element dq dp of the 
phase space is given by e~ E/kT dq dp/ffe ~ E/kT dq dp , where the energy E 
is a function of the coordinate q and the generalized momentum p. For the 
linear harmonic oscillator E = p 2 /2m + li^^mq 2 . Classical mechanics, 
as a simple integration shows, yields for the average energy U — U 
ffEe~ E/kT dq dp/ffe~ E/kT dq dp = kT and hence leads to Eq. (1.9). If, 
however, it is postulated that E can take on only the discrete values E n 
— ne = nhv with n = 0, 1, 2,..., then 213 


2E n e- E ' kT 
U = - 





€ 

e €/kT - 1 


in agreement with Eq. (1.16). It was in view of this elementary derivation 
of the radiation formula that Planck made for the first time the explicit 
statement that the energy, and not only the average energy, of an oscillator 
of frequency v is an integral multiple of the energy element hv. On the 
basis of these considerations Planck interpreted h as the finite extension of 
the elementary area in phase space. The motions of linear harmonic oscil- 
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lators of energy E = p 2 /2m + y3q* are described in phase space by a fam¬ 
ily of concentric ellipses with semiaxes (2 E //3) 112 and (2 mE) 112 . If it is 
now assumed, continued Planck, that these ellipses, instead of forming a 
continuum, are separated from each other so that the intervening areas of 
phase space are of magnitude h or, in other words, that the «th ellipse 
encloses an area J f dq dp = nh , then the energies of the oscillators are 
multiples of Av. 214 

These considerations, which prima facie might be regarded as merely 
graphical representations, constituted an important turning point in the 
development of quantum theory. They led Planck to the far-reaching idea 
that the appearance of energy quanta is the result of a much more pro¬ 
found and general principle. Planck consequently declared that the con¬ 
cept of energy quanta, which until then had been at the forefront of theo¬ 
retical research, was in fact only of secondary importance since it is the 
consequence of a more basic condition 

E+e 

j j dqdp = h 

E 

“One should therefore confine oneself to the principle that the elementary 
region of probability h has an ascertainable finite value and avoid all 
further speculation about the physical significance of this remarkable con¬ 
stant.” 215 

Planck’s address, in fact, was an appeal to abandon all attempts at 
finding a classical interpretation of h and to search, instead, for a clarifica¬ 
tion of the basic principles which underlie the manifestation of energy 
quanta. It was generally admitted that the revisions which had to be intro¬ 
duced into classical statistics were only an indication of the need for a far- 
reaching modification of classical mechanics whose details were far from 
being understood. Planck expressed this feeling in the opening of his ad¬ 
dress to the Congress when he said: “The framework of classical dynam¬ 
ics, even if combined with the Lorentz-Einstein principle of relativity, is 
obviously too narrow to account also for all those physical phenomena 
which are not directly accessible to our coarse senses.” 216 This statement 
by Planck is one of the earliest, if not the earliest, renunciations of the 
universal validity of classical mechanics. Eucken, the editor of the Ger¬ 
man edition of the Proceedings of the Congress, formulated the problem as 
follows: 217 “Is quantum theory a branch of conventional mechanics; or, 
conversely, is traditional mechanics a special case of a more general me¬ 
chanics (which comprises, or is identical with, quantum theory); or, fin¬ 
ally, is quantum theory a science altogether outside mechanics?” 

The first step in this groping toward a new mechanics was made by 
Arnold Sommerfeld when he presented a paper on “The application of 
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Planck’s theory of the quantum of action to molecular physics” 218 to the 
Solvay Congress. Referring to the term “quantum of action,” which in his 
view was “a most fortunate” name, Sommerfeld suggested that Planck’s h 
should be related, not merely nominally, to Hamilton’s “action function” 
SL dt , where L denotes the kinetic potential, viz., the difference between 
the kinetic energy T and the potential energy V of the dynamic system 
under consideration. Sommerfeld proposed the fundamental hypothesis 
that in every elementary process the atom gains or loses a definite amount 
W of action, where W = L dt = h /2ir and r is the duration of the 
process. Like every basic principle the statement could not, of course, be 
proved in a rigorous manner. Sommerfeld could only vindicate it by a 
number of arguments such as its relativistic invariance or the fact that it 
easily explained the production of x-rays or y rays, arguments which in his 
opinion were sufficiently convincing. Sommerfeld also showed that his 
hypothesis readily accounted for the photoelectric effect. Let us briefly 
describe Sommerfeld’s derivation of Einstein’s photoelectric equation. 
The electron, subject to a semielastic force — fx with x = 0 as its equilibri¬ 
um position at time t = 0, is supposed to be acted upon by the external 
electric force F= E cos nt , where v = n/2ir denotes the frequency of the 
monochromatic incident radiation. The equation of motion is mx + fx 
= eE cos nt y the kinetic energy T = \mx 2 > and the potential energy 
V = fx 2 . According to the principle proposed, the electron would be lib¬ 
erated as soon as W, which after partial integration and in view of the 
equation of motion can be written as \mxx — \e$ T 0 xF dt, reaches the value 
h /2ir. The first term \mx jc , which refers to the instant t = r (it vanishes 
for t = 0), is the kinetic energy of the electron at that instant divided by 
the proper circular frequency n 0 = 2 ttv 0 = (f Vm ) 112 . This can be seen as 
follows. W, like x and jc , is a fast-oscillating function with slowly varying 
amplitude and therefore attains the value h /2ir for the first time in the 
vicinity of a maximum (otherwise it would already have reached this 
value in the preceding oscillation). It follows for t = r that d W /dt = 0 or 
T = V, which implies jc = HqX and \mxx = T/n 0 . Sommerfeld then 
showed that for resonance, when n = n 0 , the second term in the expres¬ 
sion for W, viz., the time virial of the external force, can be neglected in 
comparison with T. Consequently, W=h /2 tt = T/n 0 =T/n or 
T = Av. 219 But this, apart from the work function, is Einstein’s photoelec¬ 
tric equation. The experimentally established fact that the energy of the 
ejected photoelectron depends on the frequency v of the incident radi¬ 
ation, but not on its intensity, was thus for Sommerfeld a direct conse¬ 
quence of his newly proposed fundamental hypothesis or, conversely, 
could be regarded as evidence for its validity. 

Sommerfeld’s principle, although soon discarded in the subsequent 
development of the theory, has been described at some length because of 
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its historical importance in having initiated an approach different from all 
methods proposed so far. For so far, quantum phenomena had been treat¬ 
ed by what may be described most appropriately as quantum statistics of 
the harmonic oscillator. It originated from Planck’s study of black-body 
radiation and accounted for this phenomenon. But almost no other field in 
physics was affected by it. The only exception, prior to the first Solvay 
Congress, will now be discussed in some detail. 


1.5 Applications of Quantum Conceptions to The Molecular 

Kinetic Theory 

Reviewing the status of quantum theory as discussed so far, we may say 
that Planck’s theory, which, as we have seen, was from the mathematical 
point of view ultimately the outcome of an interpolation, was from the 
methodological point of view, until the first Solvay Congress, almost a 
theory ad hoc. In fact, until 1907 it was purely ad hoc , constructed to 
account for black-body radiation and without any relevance to other phe¬ 
nomena in physics. If it is agreed that a physical theory becomes signifi¬ 
cant only if it also accounts for phenomena other than those for whose 
explanation it was conceived, then it was Einstein’s paper of 1907 on 
“Planck’s theory of radiation and the theory of specific heats” 220 which 
endowed Planck’s work with physical significance. For Einstein showed 
that quantum conceptions can be successfully applied also outside the 
area of radiation problems when he explained certain inconsistencies in 
the classical molecular kinetic theory on the basis of Planck’s theory. 

Referring to Planck’s result that the mechanism of energy transfer 
admits only of energy values that are integral multiples of hv, Einstein 
declared: “We should not be satisfied, I believe, with this conclusion. For 
the following question forces itself upon our mind: If it is true that the 
elementary oscillators that are used in the theory of energy transfer be¬ 
tween radiation and matter cannot be interpreted in terms of the present 
molecular kinetic theory, must we then not also modify our theory for the 
other oscillators which are used in the molecular theory of heat? In my 
opinion there can be no doubt about the answer. If Planck’s theory of 
radiation really strikes the core of the matter, then it should be expected 
that in other areas of the theory of heat contradictions also exist between 
the present molecular kinetic theory and experience which can be resolved 
by the method just proposed.” 221 

Such a contradiction was known to exist in the theory of specific heats. 
During the fifty years following the statement by Dulong and Petit that 
“the atoms of all simple bodies have exactly the same heat capacity,” 222 
nobody, it seems, doubted the correctness of this “law,” and Regnault’s 223 
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extensive measurements of specific heats only seemed to confirm it. It 
should be noted, however, that almost all these measurements were car¬ 
ried out at room temperature or above since analogous experiments at 
lower temperatures at that time would have posed insurmountable techni¬ 
cal difficulties. The first conflict occurred in 1872 when Weber 224 mea¬ 
sured the specific heat of diamond below room temperature and found 
that at — 50 °C its molar heat was only 0.76 cal/mole deg. Prior to 1896 
only two other experiments seem to have been significant in this respect, 
those performed by Pebal and Jahn 225 and those carried out by Schiiz, 226 
the former working with antimony and the latter with alloys and amal¬ 
gams of low melting points. But in spite of the scarcity of empirical data of 
this kind, Boltzmann 227 and, particularly, Richarz 228 had already formu¬ 
lated a theory of specific heats which was to account also for the excep¬ 
tions of the Dulong-Petit law. In 1895 in his ice-machine factory in Mu¬ 
nich Linde invented 229 the process of liquefying air by utilizing the 
Joule-Thomson effect and thus made further progress possible. In fact, 
this invention advanced the technique of measuring specific heats at lower 
temperatures to such an extent that only three years later, in 1898, at the 
Institute of Physics of the Berlin University, Behn 230 was able to sum up 
his findings with these words: “The graphical representation of the de¬ 
crease of specific heats with temperature seems to suggest that all curves 
converge at absolute zero so that the specific heats at that temperature 
have the same extremely small value (0?). However, this result can here 
be expressed only as an assumption.” 231 Further investigations by Til- 
den 232 and Dewar 233 seemed to conform with this result. 

This, then, was the experimental situation prior to Einstein’s paper of 
1907. From the theoretical point of view, the Dulong-Petit rule according 
to which the product of the atomic or molecular weight and the specific 
heat is a constant for all solids (approximately 6 cal/mole) was, of course, 
perfectly accounted for by classical physics. For, according to the law of 
equipartition and on the assumption that for small vibrations the potential 
energy of the particles is a quadratic function of their displacements, each 
of the TV particles with three degrees of freedom (A is Avogadro’s num¬ 
ber) possesses on the average the total energy 3 kT at temperature T. The 
energy per mole is consequently 3 RT and the specific heat per mole ap¬ 
proximately 6 cal. 

It was Einstein who realized at the end of 1906 that the systematic 
diminution of the specific heats of solids with decreasing temperature can 
be accounted for on the basis of Planck’s quantum conceptions. Neglect¬ 
ing atomic interactions and assuming that all the atoms in the solid are 
oscillating with the same frequency, he applied Eq. (1.16) for the mean 
energy of the atom and obtained for the energy per mole the expression 
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3 Nhv 

exp(fiv/T) — 1 

from which he determined the specific heat per mole 


c _ dE_ _ 3R (ygv/ T) 2 exp (0v/ T) j 2 0) 

v dT [exp(/?v/7’) — l] 2 

where P=h /k, a formula which shows that C v decreases with T but is 
equal to 3 R at sufficiently high temperatures. 

Einstein’s formula (1.20) was found to fit experimental data remark¬ 
ably well. It was only at extremely low temperatures that it showed small 
deviations from the observed results. The reason for this discrepancy, as 
Einstein 234 later recognized himself, was the assumption that the vibra¬ 
tions of the atoms in the solid all have the same frequency v. Debye’s 
“continuum theory” 235 of the specific heat, which treated the lattice of the 
solid as a continuum with a continuous spectrum of vibrational frequen¬ 
cies cut off at a characteristic maximum and which led to the famous T 3 
law, and Bom’s modified cutoff procedure 236 in accordance with his theo¬ 
ry of lattice vibrations, developed in collaboration with von Karman, were 
essentially only refinements of Einstein’s approach. 

Einstein’s work on specific heats was important not only, as we have 
pointed out, from the methodological point of view in so far as it extended 
the applicability of quantum conceptions to the molecular kinetic theory, 
but also for the following reason. 237 Walther Nemst, as we know from 
various sources, 238 initially disliked quantum theory, claiming that it was 
“really nothing else than an interpolation formula.” 239 Later, however, 
when his attention was drawn to the exact determination of specific heats 
at low temperatures in connection with his discovery of the famous 
theorem which carries his name—and which, as we know today from 
quantum-statistical mechanics, is indeed but a macroscopic manifestation 
of quantum effects—and when his experiments and those of his collabora¬ 
tors, such as Eucken, seemed to agree with Einstein’s formula (1.20), he 
changed his mind and called Planck’s work “a most ingenious and fruitful 
theory.” 240 In a paper presented to the Berlin Academy on January 26, 
1911, Nemst declared: “At present, quantum theory is essentially only a 
rule for calculation, of apparently a very strange, one might even say 
grotesque, nature; but it has proven so fruitful by the work of Planck, as 
far as radiation is concerned, and by the work of Einstein, as far as molecu¬ 
lar mechanics is concerned,... that it is the duty of science to take it seri¬ 
ously and to subject it to careful investigations.” 241 

Thus, Nemst thought it advisable to organize an international meeting 
of prominent physicists with the exclusive purpose of discussing the basis 
problems and issues of quantum theory. When the Belgian industrialist 
Ernest Solvay offered his generous support, it was primarily due to 
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Nemst’s initiative that the meeting convened and became an important 
event in the history of physics. Ultimately, as we saw, Einstein’s work on 
specific heat was, though only indirectly and in a roundabout way, one of 
the factors responsible for the convocation of the first Solvay Congress. 242 
In a similar way it also gave rise to the earliest attempt at a new approach 
to quantum theory, an approach which later proved to be important from 
the didactic point of view. 

The trend of Nemst’s work in physical chemistry did not direct his 
interest to the radiative aspects of quantum physics, aspects which, most 
probably, he never studied thoroughly. A careful analysis of Nemst’s 
reactions and comments during the discussions of the various reports at 
the Congress would only corroborate this contention. On the other hand, 
Nemst was deeply interested in the question whether quantum physics 
could not be developed on a basis independent of the study of black-body 
radiation and in such a way that in its logical structure considerations 
pertaining to the molecular kinetic theory gain logical priority over those 
pertaining to radiation phenomena. Indeed, one of Nemst’s papers, enti¬ 
tled ‘‘Specific heat and quantum theory,” 243 written in collaboration with 
Lindemann, begins with these words: “In a recently published investiga¬ 
tion one of us [it was Nemst] has given a representation of quantum 
theory which, following Einstein, considers radiative phenomena as only 
secondary circumstances (‘begleitende Umstande’) and takes as its im¬ 
mediate point of departure the atomic vibrations.” 244 It was the earliest 
attempt to approach quantum physics from the molecular kinetic theory. 
Compared with the traditional introduction to quantum physics via the 
theory of black-body radiation, this alternative approach is regarded as 
preferable from the didactic viewpoint by all those teachers of modem 
physics who consider it unfortunate for the logical development of our 
subject that its history began with so complicated a concept as Planck’s 
quantization of the electromagnetic vibrations in an isothermal cavity. 245 
Nemst’s point of view did not gain prominence at the Congress, as the title 
of the official proceedings, “La Theorie du Rayonnement et les Quanta” 
or that of the German version, “Die Theorie der Strahlung und der Quan- 
ten,” indicates. 

The significance of this meeting for the conceptual development of our 
theory should have become evident from our preceding discussion of its 
deliberations. But apart from its general importance for its participants 
and its appeal to their concerted effort to find a new mechanics of a much 
wider scope than that of the harmonic oscillator, it seems also to have had 
a decisive effect in an indirect way on others who did not actively partici¬ 
pate. Bohr 246 recalled how impressed he had been by Rutherford’s “vivid 
account of the discussions” when he met Rutherford in Manchester in 
1911, shortly after Rutherford had returned from Brussels. In his autobio- 
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graphical notes, 247 Louis de Broglie described with what ardor and enthu¬ 
siasm he studied the text of the discussions which his older brother Maur¬ 
ice, one of the scientific secretaries of the meeting, was editing for 
publication. As a result of this, he declared, “I decided to devote all my 
efforts to investigate the real nature of the mysterious quanta that Planck 
had introduced into theoretical physics ten years earlier.” 248 

It is convenient to define as the first phase in the development of quan¬ 
tum theory the period in which all quantum conceptions and principles 
proposed referred exclusively to black-body radiation or harmonic vibra¬ 
tions. It is the period of the early quantum statistics of the harmonic 
oscillator. If we accept this definition, we may regard the Solvay Congress 
of 1911 as the closing act of this phase or as the prelude of a new period. 

In concluding this chapter, we should note that the first stage in the 
conceptual development of quantum theory, although devoid of any far- 
reaching or powerful theories, serves as an outstanding example of human 
ingenuity. For it showed how far man’s intellect can penetrate into the 
secrets of nature on the basis of comparatively inconspicuous evidence. 
Most of the concepts formed at this stage will be vindicated, as we shall 
see, by the later development of the theory, though on different grounds. 
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CHAPTER 2 

Early Applications of Quantum 
Conceptions to Line Spectra 


2.1 Regularities in Line Spectra 

In the preceding chapter we saw how the study of the single physical 
phenomenon of black-body radiation led to the conceptions of quanta and 
to the quantum statistics of the harmonic oscillator, and thus to results 
which defied the principles of classical mechanics and, in particular the 
equipartition theorem. It was generally agreed that classical physics was 
incapable of accounting for atomic or molecular processes. But how to 
modify classical physics by new conceptions was as yet an open question. 

It was clear that further progress was contingent upon a broadening of 
the scope of pertinent physical phenomena. Indeed, it was the study of the 
discrete spectra of the chemical elements which constituted the next stage 
of development after the study of the continuous spectra 1 of thermal radi¬ 
ation and contributed decidedly to the development of quantum theory. 

Ever since Newton laid the foundations of spectroscopy in 1666, the 
question concerning the origin of spectral lines had been regarded as an 
important problem in physics. Late-nineteenth-century physicists inter¬ 
preted the spectrum of a given element in a way analogous to the acoustic 
vibrations of a sound-producing body. They assumed that the whole spec¬ 
trum of a given element originated in the natural periods of oscillations of 
one and the same atom. Thus, to mention only one example, Herschel 
inferred from Balmer’s famous formula that atomic processes resemble 
acoustic phenomena in organ pipes. 2 

Mitscherlich 3 was the first to point out that spectroscopy should be 
regarded not only as a method of chemical analysis—spectroanalysis had 
just recorded its first spectacular achievements—but also as a clue to the 
secrets of the inner structure of the atom and the molecule. His own 
research, however, was confined to a merely qualitative comparison of 
different spectra of elements and compounds. Five years later, Mascart 4 
was the first to draw attention to the existence of definite arithmetical 
relations between the wavelengths of certain lines in the same spectrum. 
He interpreted the double lines in the spectrum of sodium and the triplets 
in that of magnesium as harmonic vibrations. Inspired by Mascart’s re- 
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marks, Lecoq de Boisbaudran 5 in 1869 thought he had found that the 
spectral lines of nitrogen, whose wavelengths he measured in units of 10“ 6 
mm, exhibit similar harmonic relations. He tried to show that this spec¬ 
trum is made up of two sets of bands, one extending from the red to the 
green and the other from the green to the violet, and that each band of the 
second set had a wavelength which was in the ratio of three to four with 
that of a corresponding band of the first set. Although soon disproved by 

o 

R. Thalen, a pupil of Angstrom, Lecoq’s paper inaugurated an intensive 
search for numerical regularities in spectra. 6 The first study of the spec¬ 
trum of hydrogen, from this point of view, was made by G. Johnstone 
Stoney. In accordance with his basic assumption “that the lines in the 
spectra of gases are to be referred to periodic motions within the individ¬ 
ual molecule, and not to the irregular journeys of the molecules amongst 
one another,” 7 Stoney declared that “one periodic motion in the molecule 
of the incandescent gas may be the source of a whole series of lines in the 
spectrum of the gas,” 8 an idea which, if mathematically formulated, 
would have led him to a Fourier analysis of the motion. To substantiate his 
claim, Stoney referred to Angstrom’s 9 famous measurements of the wave¬ 
lengths of the four then known hydrogen spectral lines: 6562.10 X 10~ 7 
mm, 4860.74X 10 -7 mm, 4340.10X 10 -7 mm, and 4101.2X 10 -7 mm. 
Stoney contended that three of these lines, namely, those with wave¬ 
lengths, if reduced to vacuum, k x = 6563.93 X10 -7 mm, 
A 2 = 4862.11 X 10 -7 mm, and k 3 = 4102.37 X 10 -7 mm, “are to be re¬ 
ferred to the same motion in the molecule of the gas” 10 and are “the 20th, 
27th and 32nd harmonics of a fundamental vibration whose wavelength in 
vacuo” is A 0 = 0.13127714 mm. 11 

Although forced to admit that “the other harmonics (19th, 21st, 
22nd) are not found in this spectrum of hydrogen,” Stoney was confident 
that further examinations would eventually establish additional harmon¬ 
ics. 12 In carrying out this research, subsequently with the assistance of Dr. 
Reynolds of Dublin, 13 Stoney found it “convenient to refer the positions of 
all lines in the spectrum to a scale of reciprocals of the wave-lengths” 14 
rather than to that of wavelengths used so far. His reasons for this innova¬ 
tion were as follows: “This scale has the great convenience, for the pur¬ 
poses of the investigation, that a system of lines with periodic times that 
are harmonics of one periodic time are equidistant upon it; and it has the 
further convenience, which recommends it for general use, that it resem¬ 
bles the spectrum as seen in the spectroscope much more closely than the 

o 

scale of direct wave-lengths used by Angstrom in his classic map.” And 
Stoney continued: “If, then, k be the wave-number of a fundamental mo¬ 
tion in the aether, its wave-length will be \/k of a millimeter, and its 
harmonics will have the wave-lengths 1/2 k, 1/3 k, etc., in other words, 
they occupy the positions 2k, 3k, etc., upon the new map.” 
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Stoney—the same Stoney who twenty-three years later introduced the 
term “electron” into physics—thus was the first to define the subsequent¬ 
ly so important notion “wave-number.” Even his original notation (k) is 
common usage in modem theoretical physics, where it generally de¬ 
notes—for the sake of further computational convenience—the number 
of waves in 2ir units of length (2ir cm) rather than per millimeter as 
proposed by Stoney. 

It was in the course of similar investigations that Liveing and Dewar 15 
discovered in the spectra of sodium and potassium the existence of two 
series of lines, one of sharp and one of diffuse lines, and established 16 the 
existence of homologous series, that is, series of lines of the same type in 
chemically analogous elements. At the same time, Arthur Schuster 
stressed the importance of the search for “an empirical law connecting the 
different periods of vibration in which we know one and the same mole¬ 
cule to be capable of swinging.” 17 He even applied the theory of probabili¬ 
ty to calculate the accidental occurrence of harmonic ratios on the as¬ 
sumptions that no such law exists and that all lines are distributed at 
random. Having thus tested the statistical significance of the available 
data, Schuster 18 concluded that “most probably some law hitherto undis¬ 
covered exists which in special cases resolves itself into the law of harmon¬ 
ic ratios.” 

Three years later, such an empirical formula, the first to comprise 
correctly all lines of a spectral series, was published by Johann Jakob 
Balmer,a schoolteacher in Basel. 19 His first paper, published in the Ver- 
handlungen der Naturforschenden Gesellschaft in Basel (Proceedings of 
the Scientific Society of Basel) for the year 1885, contained the prophetic 
words: “It appears to me that hydrogen.. .more than any other substance 
is destined to open new paths to the knowledge of the structure of matter 
and its properties. In this respect the numerical relations among the wave¬ 
lengths of the first four hydrogen spectral lines should attract our atten¬ 
tion particularly.” 20 Balmer noticed that the wavelengths of the four hy- 

o 

drogen lines as measured by Angstrom could be represented in terms of a 
“basic number” h = 3645.6X 10 -7 as \h> f/z, ^/z, and § h or, equivalently, 
as §/z, ||/z, §f/z, and ^/z. He thus saw that the numerators form the sequence 
3 2 , 4 2 , 5 2 , 6 2 whereas the corresponding denominators are the differences 
of squares 3 2 — 2 2 , 4 2 — 2 2 , 5 2 — 2 2 , 6 2 — 2 2 . He therefore expressed the 
wavelengths of these lines by the formula 

X = h -^-—-mm (2.1) 

m 2 - 2 2 

where h = 3645.6X 10 -7 and m = 3, 4, 5, 6. Balmer also predicted the 
existence of a fifth line wavelength 3969.65 X 10 -7 mm. Informed by von 
Hagenbach that this line and other additional lines had indeed been dis- 



68 


Conceptual Development of Quantum Mechanics 


covered by Huggins, Balmer showed in a second paper 21 that his formula 
applied to all twelve then known hydrogen lines. He also predicted cor¬ 
rectly that in the series which subsequently carried his name, no lines of 
wavelength longer than 6562 X10 -7 mm would ever be discovered and 
that the series would converge at 3645.6X10 -7 mm. The agreement be¬ 
tween the calculated and the observed values of the wavelengths was ex¬ 
tremely close in the visible region, but for shorter wavelengths slight sys¬ 
tematic discrepancies were evident. In view of these discrepancies Balmer 
expressed some doubts whether the fault lay with the formula or with the 
data. 

Balmer’s discovery had immediate repercussions and gave new impe¬ 
tus to a most intensive search for other regularities in spectra. In an ad¬ 
dress delivered before the British Association for the Advancement of 
Science held in Bath in 1888, Runge 22 announced that in certain spectra 
he had discovered a number of harmonic series of lines whose wavelengths 
conform to formulas of the type 


1 be 

— = aA -I- 

A mm 2 


or 




where a , b , c, a\ b ', c ' are constants and m is an integer. Balmer’s formula 
(2.1), he pointed out, was a special case of the first type mentioned, name¬ 
ly, when a = A -1 , b = 0, and c = — Ah _1 . In a detailed study on the 
structure of emission spectra, published in the Transactions of the Royal 
Swedish Academy in 1890, Rydberg 23 claimed he had been using Balmer’s 
formula long before its publication. Stating as a fundamental principle 
that “in the spectra of all the elements analyzed so far there are series of 
rays whose wavelengths or wave numbers are functions of consecutive 
integral numbers,” 24 Rydberg expressed these functions by a general for¬ 
mula n = n Q — No/(m + fi) 2 , where n was the wave number [“nombre 
des longueurs d’onde sur l’unite de longueur (1 cm) ” ], n 0 and fi constants 
for each series, and N 0 , later called “Rydberg’s constant” and denoted by 
R f a constant common to all series and to all elements. Having shown that 
Balmer’s formula was a special case with n 0 = h~ x , N 0 = 4h~\ and 

o 

/i = 0, Rydberg calculated, on the basis of Angstrom’s measurements of 
the first four hydrogen lines, the value of N 0 as 109721.6 cm - *. 

Referring to the discoveries of Liveing and Dewar, who identified the 
principal, sharp (“second subordinate series”) and diffuse (“first subor¬ 
dinate series”) series in the spectra of the alkali metals, and to the work of 
Hartley, 25 who observed recurrent frequency differences for the compo¬ 
nents of certain doublet and triplet series (“Hartley’s Law”), Rydberg 
showed that the wave numbers of the lines in these series can be expressed 
as functions of an integer m as follows: the component with the longer 
wavelength in each doublet of the principal series could be expressed by 
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n = n 0 — No/(m +/?i) 2 and the other component by n = n 0 — Nq/ 
(m + p 2 ) 2 \ correspondingly, the two components of each line in the first 
subordinate series could be expressed by n = n' 0 — No/(m + d) 2 and 
n = n’o — No/(m + d) 2 , and those of the second subordinate series by 
n — riQ— No/(m + s ) 2 and n = Hq — No/(m + s ) 2 . In 1896 Rydberg 26 
and (independently) Schuster 27 discovered that the difference between 
the limit of the principal series and the common limit to the two other 
series coincided with the wave number of the first line of the principal 
series, that is, that the limits n’ Q and n% of the subordinate series coincided 
with Nq/{ 2 + p 2 ) 2 and Nq/{2 + p x ) 2 , and similarly, that the limit of the 
principal series coincided with Nq/( 1 +^) 2 . The series were therefore 
represented by the formulas 


(l+s) 2 (m+p) 2 

with p = Pi orp 2 and m = 2, 3,.. .for the two components of the principal 
series, 


n = 


N 0 


N 0 


and n = 


No 


No 


(2 +/> 2 ) 2 (m + d) 2 (2 +Pi) 2 (m + d ) 2 

with m = 3, 4,.. .for those of the diffuse series, and finally 


it = 


No 


No 


and n = 


N, 


o 


No 


( 2+p 2 ) 2 (m + s) 2 “ (2 +Pt) 2 (m + s ) 2 

with m = 2, 3,.. .for those components of the sharp series. In 1907 Berg- 
mann 28 found in the infrared spectra of potassium, rubidium, and cesium 
a fourth series whose convergence limit coincided with Nq /(3 + d) 2 and 
whose wave numbers were expressed by 


(3 + d) 2 (m +f) 2 

where m = 4,5,... .Some of the lines of this series, which was later called 
the “fundamental series” or “Bergmann series,” had been discovered pre¬ 
viously by Saunders. 29 

It occurred to Rydberg 30 and was later explicitly stated as a fundamen¬ 
tal principle by Ritz 31 that the frequency of every spectral line of an ele¬ 
ment could be expressed as the difference between two terms or “spectral 
terms,” each of which contained an integer. Ritz’s principle, or, as it was 
subsequently called, “the combination principle,” could not be accounted 
for by classical physics. In order to understand the reason for this incom¬ 
patibility, it must be noted that the above-mentioned hypothesis accord¬ 
ing to which the spectrum as a whole was thought to be produced by the 
free vibrations of one single atom had meanwhile been refuted. For Con¬ 
way 32 showed convincingly, in 1907, that each atom could give rise to only 
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one spectral line at a time. Conway’s contention was further elaborated by 
Bevan’s 33 analysis of the anomalous dispersion by potassium vapor. Be- 
van showed that any theoretical explanation of this phenomenon, if 
worked out on the basis of the previous hypothesis, would necessarily 
imply an excessively great number of electrons per molecule. Each indi¬ 
vidual line in a spectrum had therefore to be associated with the periodic 
motion of an electron and the different lines of the spectrum with motions 
of excited electrons in different atoms. Classically, the spectrum of light 
had consequently to contain, together with the fundamental vibration, the 
higher harmonics whose frequencies have the form of a sum of integral 
multiples of the fundamental frequencies, a result inconsistent with Ritz’s 
combination principle. We thus see that the study of the discrete spectra, 
like that of the continuous spectra, led to serious inconsistencies with 
classical physics. 


2.2 Bohr’s Theory of the Hydrogen Atom 

Our preceding remarks on spectroscopy were confined to merely tracing 
the origin and early development of the combination principle. Other 
aspects of spectroscopic research and their influence on the conceptual 
development of quantum theory will be discussed in a different context. 
At present, it should be noted that the vast amount of work invested in the 
search for numerical relations among spectral lines was motivated by the 
hope that these relations, by analogy to certain problems in the theory of 
mechanical and acoustical vibrations, would throw new light upon the 
nature of the proper oscillations of the atom or its electrons and thus lead 
to an understanding of atomic structure and of microscopic processes. 
The combination principle, however, revealed the futility of such an ap¬ 
proach. It gradually became clear that the establishment of merely math¬ 
ematical relations devoid of any consistent theory would be labor in vain. 
The only way to avoid this impasse, it seemed, was to take refuge in a 
model of the atom whose structure could be based on independent evi¬ 
dence from other sources and to apply the mathematical relations to the 
model. The outcome of this conceptual development was, of course, Niels 
Bohr’s synthesis of the combination principle with Rutherford’s atomic 
model on the basis of Planck’s quantum conception. 

In fact, Bohr’s work itself was intimately connected with the problem 
of finding a consistent model of the atom. Even external circumstances of 
his life were connected with it. For when Bohr left J. J. Thomson’s labora¬ 
tory in Cambridge, in March, 1912, to join Rutherford’s team in Man¬ 
chester, it was because of a disagreement with Thomson concerning the 
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latter’s “plum-cake” model of the atom—or, as Condon once phrased it, 
because “J. J. politely indicated that it might be nice if he [Bohr] left 
Cambridge and went to work with Rutherford.” 34 It was a fortunate turn 
of fate, not only with respect to the outcome when eventually Bohr’s 
brilliant and successful theory of the hydrogen atom contributed decided¬ 
ly to the general acceptance of the Rutherford model, but also with respect 
to the whole time of Bohr’s stay in Manchester; for there he found sympa¬ 
thetic understanding and even encouragement from the very beginning of 
his work. More than fifteen years later Rutherford recollected: “On my 
side, the agreement with Planck’s deduction of e early made me an adher¬ 
ent to the general idea of a quantum of action. I was in consequence able to 
view the equanimity and even to encourage Prof. Bohr’s bold application 
of the quantum theory to explain the origin of spectra.” 35 

For a full comprehension of Bohr’s work we shall have to discuss brief¬ 
ly the development of atomic models at that time. 

Nineteenth-century models of the atom, such as those proposed by 
Kelvin, Helmholtz, or Bjerknes, were primarily mechanical or hydro- 
dynamical and were invalidated by the discovery of electrons and radioac¬ 
tive disintegration. One of the earliest models consistent with these dis¬ 
coveries was proposed by Perrin 36 in a popular lecture delivered to 
students and friends of the Sorbonne in 1901. It consisted of a positively 
charged particle surrounded by a number of electrons (“sorte de petites 
planetes”) compensating the central charge. Perrin assumed that the in¬ 
ternal electromagnetic forces could produce a dynamically stable system 
whose rotational periods would correspond to the frequencies or wave¬ 
lengths of the lines in the emission spectrum of the atom. 

Two years later, in December, 1903, J. J. Thomson 37 developed the 
atomic model to which reference was made in the preceding chapter. The 
hydrogen atom, for example, was represented by a positively charged 
sphere of a radius of about 10“ 8 cm with an electron oscillating at its 
center. Thomson’s model had one great advantage: it implied without any 
additional assumptions a quasi-elastic binding of the electrons, a property 
which had been a basic hypothesis for Drude, Voigt, Planck, and Lorentz 
in their work on dispersion, absorption, and other phenomena. In particu¬ 
lar, it was a convenient assumption for the explanation of monochromatic 
spectral lines whose frequencies were independent of the energy of vibra¬ 
tion. 

Thomson’s model, however, failed to account for the large angle de¬ 
flections, up to 150° with reference to the incident direction, in the scatter¬ 
ing experiments with a particles which were carried out a few years later 
by Rutherford’s students, Geiger and Marsden. 38 When Thomson’s ex¬ 
planations of these results on the basis of multiple scattering proved un- 
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tenable, 39 Rutherford’s theory of “single scattering” 40 and his well- 
known model of the atom became firmly established. Thomson’s model 
also failed to explain the subsequently (1913) discovered Stark effect, 
which will be discussed later on. Finally—but this, of course, was not 
known until much later—if the Thomson model is quantized, 41 the result¬ 
ing energy states, although almost identical with those of the Rutherford 
model, turn out to the generally nondegenerate. Thus, even had no Ruth¬ 
erford scattering experiments ever been performed, spectroscopic evi¬ 
dence alone would have decided in favor of the Rutherford model. 

At the time when Thomson developed his atomic model, the Japanese 
physicist Nagaoka, 42 in an address before the Physico-Mathematical So¬ 
ciety of Tokyo in December, 1903, proposed what he called a “Saturnian 
model.” It consisted, like Perrin’s, of a positively charged central particle 
surrounded by a number of equidistant electrons revolving with a com¬ 
mon angular velocity. Nagaoka then tried to show that the lines in the 
emission spectrum have their origin in small transversal oscillations of the 
electron configuration. Referring to the treatment of similar oscillations 
in Maxwell’s essay “On the stability of the motion of Saturn’s rings,” 43 
Nagaoka claimed he had proved the dynamical stability of his model and 
its compatibility with the spectroscopic observations of Runge and Ryd¬ 
berg. The correctness of these computations and, in particular, the stabil¬ 
ity of the model, however, were soon challenged by Schott. 44 

Perrin’s and Nagaoka’s atomic models were further elaborated by the 
Cambridge astrophysicist Nicholson 45 in an attempt to explain the nature 
of a number of unidentified lines in the spectra of nebulae and in the 
spectrum of the solar corona. Associating these lines with the hypotheti¬ 
cal elements “nebulium” and “protofluorine,” Nicholson, closely follow¬ 
ing Nagaoka’s suggestion, constructed atomic models to account for these 
lines. On the assumption of a central charge of four units surrounded by 
four electrons, Nicholson showed that the ratio between the frequencies 
corresponding to two different vibrational modes of the electronic ring 
coincided with the ratio between the frequencies of two lines in the nebu¬ 
lar spectrum. This agreement, in his view, constituted sufficient evidence 
for the adequacy of his model. Nicholson then proceeded to give detailed 
calculations for other lines and to develop a general theory, in the course 
of which he made a number of important discoveries. His theory appeared 
to him to have been given even observational support. For the frequency 
of a third mode (A 4353), which we computed as a sequel to the two modes 
mentioned before and which, at the time of his first publication, was not 
known to correspond to any known spectral line, had actually been ob¬ 
served by W. H. Wright at the Lick Observatory and by M. Wolf at the 
Heidelberg Observatory. 46 Only in 1927 was the nature of the “nebulium” 
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lines definitely clarified when Bowen showed that they are forbidden lines 
of highly ionized oxygen and nitrogen (Nil, Oil, and OIII). 47 

With respect to his model of the atom, Nicholson declared that “the 
main conception involved. . .is that of the nature of positive electricity. 
This is supported to exist in small spherical volume distributions of uni¬ 
form density, whose radius is small in comparison even with that of the 
electron, a reversal of the more generally accepted view. The mass of these 
positive units is very large in comparison with that of an electron, and 
gives rise to nearly the whole mass of an atom. .. .” 48 In his third essay, 
published in June, 1912, Nicholson suggested relating the occurrence of 
spectral series to Planck’s constant of action. “The quantum theory,” he 
stated, “has apparently not been put forward as an explanation of‘series’ 
spectra... .Yet, in the belief of the writer, it furnishes the true explanation 
in certain cases, and we are led to suppose that lines of a series may not 
emanate from the same atom, but from atoms whose internal angular 
momenta have, by radiation or otherwise, run down by various discrete 
amounts from a standard value.” 49 

Nicholson incorporated Planck’s quantum of action into his theory by 
contending that the frequencies of different observed spectral lines can be 
accounted for on the following assumption: that the ratio between the 
energy of the system and the frequency of rotation of the ring of electrons 
is given by an integral multiple of Planck’s constant. Nicholson’s theory, 
based as it was on a correspondence between the frequencies of optical and 
of mechanical vibrations, was, of course, by no means compatible with the 
Ritz combination principle. Nor was his introduction of Planck’s con¬ 
stant motivated by any considerations concerning the stability of his 
atomic model. In fact, with the exception of his casual remark to the effect 
that “the electrons in steady motion must lie in one plane, in order that 
their energy may not be dissipated by rapid radiation,” 50 Nicholson com¬ 
pletely ignored the stability problem. 

It is a curious coincidence in the history of our subject that Nicholson’s 
most important innovations, namely, his idea of a heavy nucleus and his 
conception of spectra as quantal phenomena, were actually obtained at 
the same time by independent research. For it was in May, 1911, that 
Ernest Rutherford refuted Thomson’s hypothesis of multiple scattering 
and established his well-known model of the atom in accordance with the 
large angle deflections of a particles by single encounters. 51 It was also in 
1911 that Bjerrum, following a suggestion by Nemst, 52 made the first 
successful application of the quantum principle to molecular spectra 
when he showed 53 that the quantization of the rotational energy of mole¬ 
cules accounted for certain features in the absorption spectra of gaseous 
hydrochloric and hydrobromic acids. It seems beyond doubt that neither 



74 


Conceptual Development of Quantum Mechanics 


Rutherford nor Bjerrum knew of Nicholson’s ideas and that the latter’s 
conjectures were in no way influenced by the former. It should also be 
noted that Nicholson’s anticipations of some of Bohr’s conclusions were 
based, as Rosenfeld has pointed out, 54 on a most questionable and often 
even fallacious reasoning. 

Bohr’s first research in Manchester on the absorption of a particles by 
matter, 55 which he finished in the summer of 1912, proved highly instruc¬ 
tive for his future work. Treating the problem, which had been taken up in 
terms of classical mechanics by J. J. Thomson as early as 1906, long before 
the discovery of the atomic nucleus, Bohr assumed an elastic binding of 
the atomic electrons. This binding, although entering his calculations 
only through the periods of motions which Bohr inferred from the charac¬ 
teristic resonances as known from optical dispersion, played an essential 
role in limiting the effective region of energy transfer around the path of 
the penetrating particle. Associated now with Rutherford’s atomic model, 
the binding of electrons and the concomitant problem of the stability of 
the atom had to engage Bohr’s attention from the outset of his work. For 
Bohr saw not only the merits of Rutherford’s model but also the difficul¬ 
ties which it raised and, in particular, the fact that, in contrast to Thom¬ 
son’s model, dynamical principles alone could not provide any scale for its 
size. In June, 1912, in the first draft of his classic paper “On the constitu¬ 
tion of atoms and molecules, 56 Bohr wrote: 57 “In the investigation of the 
configuration of the electrons in the atoms we immediately meet with the 
difficulty.. .that a ring, if only the strength of the central charge and the 
number of electrons in the ring are given, can rotate with an infinitely 
great number of different times of rotation, according to the assumed 
different radius of the ring; and there seems to be nothing.. .to allow, from 
mechanical considerations to discriminate between the different radii and 
times of vibration. In the further investigation we shall therefore intro¬ 
duce and make use of a hypothesis, from which we can determine the 
quantities in question. This hypothesis is: that for any stable ring (any ring 
occurring in the natural atoms) there will be a definite ratio between the 
kinetic energy of an electron in the ring and the time of rotation. 58 This 
hypothesis, for which no attempt at a mechanical foundation will be given 
(as it seem hopeless), is chosen as the only one which seems to offer a 
possibility of an explanation of the whole group of experimental results, 
which gather about and seems to confirm conceptions of the mechanisms 
of the radiation as the ones proposed by Planck and Einstein.” This draft, 
composed of six manuscript sheets (one of which is unfortunately lost), 
was written by Bohr as a private communication to Rutherford and is 
naturally an important document for the understanding of the conceptual 
development of Bohr’s atomic theory. Bohr realized that the stability of 
the Rutherford model could by no means be reconciled with the principles 
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of Newton’s mechanics and Maxwell’s electrodynamics. For no system of 
point charges, according to these principles, ever admits of a stable equi¬ 
librium, and any dynamical equilibrium, based on the motions of elec¬ 
trons, must lead to a radiative dissipation of energy accompanied by a 
steady contraction of the system. 

Since in Planck’s discovery a fundamental limitation of classical phys¬ 
ics had already been revealed, Bohr argued that the answer to the stability 
problem had to be sought in Planck’s quantum of action. In contrast to his 
predecessors who related Planck’s h to atomic models with the purpose of 
finding a mechanical or electromagnetic interpretation of h , Bohr recog¬ 
nized that Planck’s constant should be applied to Rutherford’s model not 
in order to elucidate the physical significance of the former, but rather to 
account for the stability of the latter. Bohr realized that to this end the 
theory should yield a constant of the dimension of length, characterizing 
the distance of the electron from the center of the stable orbit. But the only 
constant parameters that appeared in Rutherford’s model of the atom 
were those of masses and charges, and from these no constant of the 
dimension of length could possibly be formed. In the fact, however, that 
the adjunction of h to m and e made it possible to construct an expression, 
such as h 2 /me 2 > which has the dimension of length—even with the re¬ 
quired order of magnitude (~20xl0 -8 cm)—Bohr saw additional evi¬ 
dence for the correctness of his assumption. 

All previous applications of Planck’s constant to atomic models re¬ 
ferred to the Thomson atom and were generally based on the assumption 
of harmonic oscillations for which the rule of quantization could be de¬ 
rived from Planck’s work. The most noteworthy of these attempts was 
Haas’s conjecture, to which reference has been made before. 59 In an at¬ 
tempt to interpret the photoelectric effect as a resonance phenomenon 
between orbital frequencies and the frequencies of the incident radiation, 
Lindemann, 60 in 1911, applied Kepler’s laws to the assumedly elliptical 
orbits of the electrons but otherwise adhered to Thomson’s model. In 
March, 1912, Herzfeld 61 proposed a modification of Thomson’s model by 
assuming circular electronic orbits and a nonuniform charge density of 
the positive sphere and derived from these assumptions the Balmer series 
by a quantization of energy in accordance with a rule formulated by Ha- 
senohrl 62 as a generalization of Planck’s prescription for the quantization 
of the harmonic oscillator. But all these and similar 63 calculations, al¬ 
though leading to an agreement as to the order of magnitude between the 
calculated and observed values for the frequencies and dimensions of the 
atoms, lost their validity with the abandonment of the Thomson model on 
which they were based. Nicholson’s hypotheses, with which Bohr became 
acquainted only toward the end of 1912, as we know from a Christmas 
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card 64 sent by him to his brother Harald, were similarly invalidated, al¬ 
though on other grounds. 

The basic problem which Bohr had to face was the question of how to 
apply Planck’s quantum of action to the Rutherford model of the atom. 
On the assumption that the hydrogen atom consists of an electron revolv¬ 
ing around a nucleus whose charge is equal and opposite to that of the 
electron and whose mass is very large in comparison with the mass m of 
the electron, Bohr first investigated how far classical mechanics could be 
employed. From Kepler’s first law he knew that the orbit of the electron is 
an ellipse with the nucleus in one of its foci. Calling the major axis of the 
orbit 2a, and the energy which has to be added to the system in order to 
remove the electron to an infinite distance from the nucleus W , the charge 
of the electron e, and that of the nucleus e', Bohr could easily show that on 
the basis of classical principles 

co 

v ee’yj m W 

which indicates that the frequency of revolution co and the major axis of 
the orbit depend only on the value of W and are independent of the eccen¬ 
tricity of the orbit. In the special case of a circular orbit with radius a , a 
direct proof of Eq. (2.2) can readily be established. In this case, mv 2 / 
a = ee’/a 2 (centripetal force = force of attraction) shows that the total 
energy U = \mv 2 — ee’/a = — ee'/2a = — W or W = ee’/2a; and from 
v = 2ircoa the first part of Eq. (2.2) follows immediately. Bohr now saw 
that by varying W all possible values for co and 2 a could be obtained. The 
existence of sharp spectral lines, however, shows that the latter quantities 
have to assume definite values characteristic for the system. Moreover, as 
a simple consideration of the dimension verifies, no combination of e , e', 
and m can yield a quantity with the dimension of length, a condition 
necessary for its interpretation as the diameter of the atom. So far, Bohr 
ignored the radiation of energy which according to Maxwell’s theory 
should be dissipated at a rate proportional to the square of the acceleration 
of the electron. If this radiation were taken into consideration, W would 
steadily increase and with it the frequency of revolution co while the di¬ 
mensions of the orbit would continually decrease, producing a continuous 
spectrum instead of discrete spectral lines as observed. Thus the atom 
would be not only indeterminate in its dimensions but also deprived of any 
stability whatever. Realizing the incompatibility of ordinary mechanics 
and electrodynamics with Rutherford’s atomic model and in direct con¬ 
tradiction to Newton’s mechanics and Maxwell’s electrodynamics, Bohr 
boldly postulated the following assumption in order to endow the atom 
with size and stability. There exists a discrete set of permissible or station¬ 
ary orbits; and as long as the electron remains in any stationary orbit, no 
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energy is radiated. It is interesting to note that the second part of this 
assumption, though not explicitly formulated, had been used a few years 
earlier by Langevin 65 in his theory of permanent magnetism. 

Although convinced that the discreteness of the orbits (or of a) and, 
according to Eq. (2.2), also of W and co was somehow connected with 
Planck’s constant of action, Bohr was unable to identify the exact relation¬ 
ship until February, 1913. Two apparently unrelated phenomena gave 
him the clue to the solution of the problem. He had been acquainted from 
his Cambridge days with Whiddington’s experiments. 66 Whiddington 
had shown that when an anticathode was bombarded by cathode rays of 
increasing velocity, sudden changes in the nature of the emitted radiation 
were produced at certain critical velocities. It seems that Whiddington’s 
work suggested to Bohr the idea of energy levels. 67 In fact, Whiddington’s 
results, as, of course, became clear only later on, constituted in the field of 
x-rays one of the most striking verifications of Bohr’s theory, just as the 
subsequent Franck-Hertz experiments did in the range of visible light. 
The other clue came from spectroscopy. Even by the end of January, 
1913, Bohr was not fully aware of the implications of spectroscopic re¬ 
search for his problem. For in a letter dated January 31, 1913, addressed 
to Rutherford from Copenhagen, where Bohr had served since Septem¬ 
ber, 1912, as an assistant to Knudsen, Bohr declared: “I do not at all deal 
with the question of calculation of the frequencies corresponding to the 
lines in the visible spectrum. I have only tried, on the basis of the simple 
hypothesis, which I used from the beginning, to discuss the constitution of 
the atoms and molecules in their ‘permanent’ state.” 68 A few days later, 
however, the spectroscopist H. M. Hansen, who just arrived in Copenha¬ 
gen from Gottingen, where he had been working under Voigt on the in¬ 
verse Zeeman effect of lithium, drew his attention to Rydberg’s work on 
the classification of spectral lines, a subject in which Bohr had not pre¬ 
viously been interested. Asked by Hansen how the new model of the atom 
could account for the regularities discovered by Rydberg and Ritz, Bohr 
acquainted himself with the subject and soon recognized its importance 
for his problem. “As soon as I saw Balmer’s formula,” he stated repeated¬ 
ly, 69 “the whole thing was immediately clear to me.” 

How, indeed, Balmer’s formula suggested to Bohr a way of incorporat¬ 
ing quantum conceptions into the Rutherford model of the hydrogen 
atoms will now be explained. For this purpose we shall closely analyze 
Bohr’s exposition of this issue as presented in his epoch-making paper. 70 
Balmer’s formula and its immediate generalization can most conveniently 
be expressed, as we have seen in connection with the work of Ritz and 
Rydberg, by the equation 71 
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Bohr derived this formula in the first part 72 of the above-mentioned 
paper by three different methods. In his first derivation 73 he adopted the 
basic assumption of Planck’s “second theory,” 74 according to which the 
amount of energy emitted by an atomic vibrator of frequency v is rhv, r 
being an integer, namely, the number of quanta emitted, each of frequency 
v. Considering an electron at a great distance apart from the nucleus and 
“of no sensible velocity relative to the latter,” Bohr studied the binding of 
this electron with the nucleus as a result of which the electron “has settled 
down in a stationary (circular) orbit around the nucleus” with orbital 
frequency co. During this process, Bohr contended, a “homogeneous” 
(monochromatic) radiation is emitted of a frequency v, equal to half the 
frequency of revolution of the electron in its final orbit—this relation 
suggesting itself in view of the fact that “the frequency of revolution of the 
electron at the beginning of the emission is 0.” Thus, from 
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Bohr obtained by Eq. (2.2) 
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expressions which for r = 1, 
and h = 6.5 X 10 -27 yield 
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= |e'| = 4.7 X 10“ 10 , e/m = 5.31X 10 17 , 
W= 13 eV, co = 6.2X 10 15 sec“\ and 
2 a = 1.1X 10“ 8 cm in agreement with the empirical values of the ioniza¬ 
tion potential, the optical frequencies, and the linear dimensions of the 
hydrogen atom. If, as he found, the energy emitted by the formation of one 
of the stationary states is for the hydrogen atom 
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the “energy emitted by the passing of the system from a state correspond¬ 
ing to r = Tj to one corresponding to r = r 2 ” turns out to be 
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Hence, since 
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which is Balmer’s formula. 

In his second derivation 75 of the Balmer formula Bohr noted that it had 
not been necessary to assume that “a radiation is sent out corresponding 
to more than a single energy quantum /zv.” Since “as soon as one quantum 
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is sent out the frequency is altered,” it would be more consistent, he de¬ 
clared, to replace the previously assumed emission of r quanta of frequen¬ 
cy v/2 by that of one quantum of frequency rv/2. Bohr, in fact, general¬ 
ized this relationship still further by putting the ratio between the emitted 
energy and the frequency of revolution of the electron for the different 
stationary states as 

W=f(r)hcD 

where/ ( r ) is an as yet undetermined function of the integer r. This he did 
because the particular choice / (r) =r/2, as used in his first derivation, 
though plausible, did not seem to him to have sufficient logical cogency. 
Repeating now step by step the calculation of the first derivation, but with 
r/2 replaced by/(r), Bohr obtained 
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Whereas in his first derivation the particular specification/ (r) =r/2 led 
him directly to the Balmer formula, he had now to pay for the greater 
generality of his assumption with an additional step: he had to take re¬ 
course to the structure of the Balmer formula itself in which the variable 
factor has the form 1 /t 2 2 — 1/r 2 . To obtain such a factor he concluded 
that/ (r) = cr. Here c is a constant which has to be determined. For this 
purpose he considered the transition of the system between two successive 
stationary states corresponding to r = Aand r = N — 1. By a simple cal¬ 
culation he obtained for the frequency of the radiation emitted 

_ 'n 2 me 2 e' 2 2N— 1 
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and for the frequencies of revolution of the electron before and after the 
emission 
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“If Nis great,” Bohr continued, “the ratio between the frequency before 
and after the emission will be very nearly equal to 1; and according to 
ordinary electrodynamics we should therefore expect that the ratio be¬ 
tween the frequency of radiation and the frequency of revolution also is 
very nearly equal to 1. This condition will only be satisfied if c = *.” In 
fact, since 
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tends to 2c for large N t it approaches 1 only if c = Thus the result of the 
first derivation is retrieved. 
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In his third derivation 76 he dispensed altogether with Planck’s relation 
by replacing it by what he called “an interpretation of the emission by 
analogy with ordinary electrodynamics.” For, he declared, “an electron 
rotating round a nucleus in an elliptical orbit will emit a radiation which 
according to Fourier’s theorem can be resolved into homogeneous com¬ 
ponents, the frequencies of which are no, if o is the frequency of revolu¬ 
tion of the electron.” Taking consequently the frequency of the energy 
emitted during the transition from a state in which no energy is yet emit¬ 
ted to another stationary state as equal to a multiple of o/2, where o is the 
frequency of revolution of the electron in the state considered, Bohr ar¬ 
rived at the same expression as before. “Consequently,” he declared in the 
conclusion of his present discussion, “we may regard our preliminary 
considerations. . .only as a simple form of representing the results of the 
theory.” 77 

Bohr’s approach, as we saw, was essentially an application of what he 
later termed the “correspondence principle,” according to which, in the 
limit where the action involved is sufficiently large to permit the neglect of 
the individual quanta, the fundamentally statistical account of quantum 
phenomena can be represented as a rational generalization of the classical 
physical description. Bohr must have been aware that, by postulating the 
quantization of the angular momentum, he could have presented this the¬ 
ory in a mathematically more compact way. But distrusting the legiti¬ 
macy of employing the conceptions of the “old mechanics” and avoiding 
their use whenever possible, Bohr rejected this alternative and gave 
greater credence to the application of the correspondence principle. In 
fact, when he wrote the first part of his classic paper, the quantization of 
the angular momentum was for him merely an “interpretation” in terms 
of “symbols taken from ordinary mechanics.” For he declared: “While 
there obviously can be no question of a mechanical foundation of the 
calculations given in this paper, it is, however, possible to give a very 
simple interpretation of the results of the calculations by help of symbols 
taken from ordinary mechanics. Denoting the angular momentum of the 
electron round the nucleus by M, we have immediately for a circular orbit 
irM = T /o, where o is the frequency of revolution and T the kinetic 
energy of the electron; for a circular orbit we further have T = W, and 
from W — rhco/2 we consequently get M — tM 0 , where M 0 = h / 
2 tt = 1.04X 10 -27 ... .The angular momentum of the electron round the 
nucleus in a stationary state of the system is equal to an entire multiple of a 
universal value, independent of the charge on the nucleus.” 78 Clearly, in 
this case the quantization of the angular momentum and the formula 
W = \rho are mathematically equivalent statements. In the concluding 
remarks of his paper 79 Bohr summarized all his assumptions employed so 
far in the following formulation: “ (1) That energy radiation is not emitted 
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(or absorbed) in the continuous way assumed in the ordinary electrody¬ 
namics, but only during the passing of the systems between different ‘sta¬ 
tionary’ states. (2) That the dynamical equilibrium of the systems in the 
stationary states is governed by the ordinary laws of mechanics, while 
these laws do not hold for the passing of the systems between the different 
stationary states. (3) That the radiation emitted during the transition of a 
system between two stationary states is homogeneous, and that the rela¬ 
tion between the frequency v and the total amount of energy emitted E is 
given by E = hv , where h is Planck’s constant. (4) That the different 
stationary states of a simple system consisting of an electron rotating 
round a positive nucleus are determined by the condition that the ratio 
between the total energy, emitted during the formation of the configura¬ 
tion, and the frequency of revolution of the electron is an entire multiple of 
\h. Assuming that the orbit of the electron is circular, this assumption is 
equivalent with the assumption that the angular momentum of the elec¬ 
tron round the nucleus is equal to an entire multiple of h /2i r. (5) That the 
‘permanent’ state of any atomic system, i.e., the state in which the energy 
emitted is maximum, is determined by the condition that the angular 
momentum of every electron round the centre of its orbit is equal to 
h /2ir ” 

It should be noted in this context that Ehrenfest, 80 a short time prior to 
Bohr, had already applied the quantization of angular momentum with¬ 
out, however, formulating it as a general principle. This he did in his 
refinement of the Einstein-Stern theory (see ref. 104 of chap. 1 ) of the 
specific heat of diatomic gases when he showed that the assumption 
\L ( 2irv) 2 = nhv/2 (L is the moment of inertia, v the frequency, and n an 
integer) accounted for the empirical temperature dependence of the rota¬ 
tional energy of hydrogen without the need of introducing a zero-point 
energy. 

In addition to the close agreement between the calculated and the ob¬ 
served values of the Rydberg constant, Bohr found support for the cor¬ 
rectness of his theory in the fact that Eq. (2.3) applied not only to the 
Balmer series with r 2 = 2 but also for r 2 = 3 (and consequently r l = 4, 
5,...). The case r 2 = 3 corresponded to a series in the infrared which, after 
having already been predicted by Ritz, 81 was actually observed in 1908 by 
Paschen 82 and which was later called the “Paschen series.” Moreover, 
Bohr’s prediction that “if we put r 2 = 1 and r 2 = 4,5,. . ., we get series 
respectively in the extreme ultraviolet and extreme ultrared, which are not 
observed, but the existence of which may be expected” 83 was soon veri¬ 
fied. The series for r 2 = 1 was observed by Lyman 84 in 1914, that for 
r 2 — 4 by Brackett 85 in 1922, and that for t 2 = 5 by Pfund 86 in 1924. 

In 1913 it could be rightfully argued that the fact that not all lines 
predicted by Eq. (2.3) had actually been observed was an indication of the 
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deficiency of experimental procedures rather than a disproof of Bohr’s 
theory. The existence, however, of a single hydrogen line at variance with 
Eq. (2.3) would have shown that Bohr’s theory was at least incomplete, if 
not altogether faulty. It was therefore a challenge to Bohr when his atten¬ 
tion was drawn to a paper, published in 1896, in which the American 
astronomer Pickering claimed he had found hydrogen lines not accounted 
for by the ordinary Balmer formula in the spectrum of the star f Puppis. 
Nor did these lines, as it then became clear, agree with Bohr’s more gen¬ 
eral formula (2.3). Pickering claimed having observed “a series of lines 
whose approximate wave-lengths are 3814, 3857, 3923, 4028, 4203, and 
4505, the last line being very faint. These six lines,” he argued, “form a 
rhythmical series like that of hydrogen and apparently are due to some 
element not yet found in other stars or on the Earth. The formula of 
Balmer will not represent this series, but if we add a constant term and 
write A = 4650[m 2 /(m 2 — 4) ] — 1032, we obtain for m equal to 10, 9, 8, 
7,6 and 5 the wave-lengths 3812,3858,3928,4031,4199 and 4504.” 87 One 
year later Pickering identified these lines as hydrogen lines for the follow¬ 
ing reason: “A remarkable relation exists between these two series, from 
which it appears that the second series, instead of being due to some 
unknown element as was at first supposed, is so closely allied to the hydro¬ 
gen series, that it is probably due to that substance under conditions of 
temperature or pressure as yet unknown. The wave-length of the lines of 
hydrogen may be computed by the formula 3646.1 [n 2 /(n 2 — 16)] which 
is the formula of Balmer, slightly modifying the constant term so that the 
standard wave-length of Rowland shall be represented, and substituting 
\n for m. The wave-length of the lines of hydrogen may be determined by 
this formula if we substitute for n the even integers 6, 8, 10, 12 etc... .If 
now we substitute for n the odd integers 5,7,9,11 etc., we obtain the wave¬ 
lengths of the second series of lines in the spectrum of f Puppis.” 88 Thus, 
one single formula, namely, A = 3646 [n 2 /(n 2 — 16)], applied both to the 
Balmer lines (for even n) and to the newly discovered lines (for odd n). 
Kayser 89 at first expressed some doubts about Pickering’s identification of 
these lines but later 90 accepted the theory on the grounds that hydrogen 
would have two series, which “then corresponds admirably with the other 
elements.” Kayser also pointed out: “That this series has never been ob¬ 
served before, can perhaps be explained by not sufficient temperatures in 
our Geissler tubes and most of the stars.” When in July, 1897, the line 
A 5413.9, corresponding to n = 7, also was observed as “clearly visible” in 
the spectrum of f Puppis, 91 Pickering’s contentions were generally ac¬ 
cepted. 

Rydberg, 92 in accordance with his general classification of series, inter¬ 
preted Balmer’s series as the diffuse (or first subordinate) and Pickering’s 
as the sharp (or second subordinate) series of hydrogen, so that the for- 
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mer corresponded ton = n 0 — N^/im + l) 2 form = 2,3,- • .and the latter 
to n = n 0 — No/(m + 0.5) 2 . Now, in accordance with the Rydberg- 
Schuster law a “principal series” of hydrogen should exist whose wave 
numbers are given by 


(1+0.5) 2 (m+1) 2 

with m = 1,2,..., and whose first 93 member consequently is A 4687.88. In 
fact, a line k 4686 was actually observed 94 in the spectrum of f Puppis and 
also, during the Indian solar eclipse of January 22, 1898, in the spectrum 
of the sun’s chromosphere. 95 Furthermore, in 1912 Fowler observed in 
the spectrum of a Plucker discharge tube containing a mixture of hydro¬ 
gen and helium four lines at positions corresponding to Rydberg’s so- 
called “principal series of hydrogen” and a new series of lines in the ultra¬ 
violet with the same high-frequency limit. 96 

These were the facts which Bohr then had to explain on the basis of his 
theory. At first he showed that the Pickering series and the hypothetical 
“principal series of hydrogen,” originally expressed by 

v = cr(\ -- —and v = cr (—- 

\2 2 ( m + \) 2 J \ 1.5 2 m 2 ) 

with m = 2, 3,.. .respectively, could be rewritten as 


with k = 5, 7,..., and 


v = 4 cR 



v = 4 cR 




with k = 4, 6,.. .respectively, that is, as series whose Rydberg constant is 
four times that of hydrogen. Bohr could now easily show that this theory, 
if applied to the ionized helium atom with nuclear charge 2e , leads exactly 
to these formulas. 

In a letter to Rutherford, dated March 6, 1913, Bohr referred to these 
ideas and wrote that “the chemist Dr. Bjerrum suggested to me that if my 
point was right the lines might also appear in a tube filled with a mixture of 
helium and chlorine (oxygen, or other electro-negative substances); in¬ 
deed it was suggested, that the lines might be still stronger in this case. 
Now, we have not in Copenhagen the opportunity to do such an experi¬ 
ment satisfactorily; I might therefore ask you, if you possibly would let it 
perform in your laboratory, or if you perhaps kindly would forward the 
suggestion to Mr. Fowler, which may have the arrangement still stand¬ 
ing.” 97 In fact, when Evans, 98 following this suggestion, published a short 
note in Nature in which he indicated that his experiments did support 
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Bohr’s claim that helium, and not hydrogen, is the origin of the lines in 
question, Fowler" argued against Evans that the observed wavelengths of 
these lines differ, though only by a small amount, from their theoretical 
values. Bohr, 100 however, soon pointed out that if the finite ratio between 
the mass m of the electron and the mass m„ of the nucleus is taken into 
consideration, and if therefore the value of m in his formula for the Ryd¬ 
berg constant R is replaced by that of the “reduced mass” 


1 4 - m/m n 

the observed discrepancies of about 0.04 percent are fully accounted for. 
When subsequent experiments carried out by Evans 101 at the University 
of Manchester, by Fowler, 102 and by Paschen 103 corroborated this conten¬ 
tion, Fowler’s original objection became another striking confirmation of 
Bohr’s theory. 

It will also be recalled that the dependence of R on m n , as contended by 
Bohr, led Urey 104 and his collaborators in 1932 to the discovery of heavy 

* o 

hydrogen or deuterium H with an isotope shift of 1.79 A for the first 
Balmer line H a and 1.32 A for H p . 

The most direct confirmation of Bohr’s interpretation of spectral terms 
as stationary energy levels and of his frequency condition (assumptions 1 
and 3 of his summary) was afforded by a series of experiments performed 
by Franck and Hertz. 105 Electrons from a thermoionic source were accel¬ 
erated to a known energy and directed against atoms of a gas or vapor at 
low pressure. At low electronic energies only elastic collisions occurred 
and no radiation was observed. As soon, however, as the electron’s energy 
was equal to or exceeded a critical value (4.9 eV in the case of mercury 
vapor), inelastic collisions took place and radiation (A 2537, the mercury 
resonance line) was observed. The energy loss of the electron corre¬ 
sponded to the energy difference between the ground state and the excited 
state of the atom which, by transition to the ground state re-emits this 
energy in the form of light in accordance with Bohr’s frequency condition. 

Our analysis of Bohr’s contribution of 1913 would not do justice to his 
work if it confined itself to an exposition of the physical contents without 
due consideration of its philosophical foundations. First let us note that 
Bohr’s theory was, in general, very favorably accepted. When Einstein, a 
severe censor as far as physical theoretizing was concerned, was informed 
in September, 1913, by Hevesy in Vienna that Evan’s experiments had 
confirmed Bohr’s ascription of the Pickering lines to helium, he described 
Bohr’s theory as an “enormous achievement” and as “one of the greatest 
discoveries.” In fact, Einstein recognized the importance of Bohr’s theory 
as soon as he became acquainted with it. This happened when, in one of 
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the weekly physics colloquia held in Zurich conjointly by the University 
and the Institute of Technology, a report on the first part of Bohr’s 1913 
paper was given shortly after its publication. At the end of the discussion 
von Laue protested: “This is all nonsense! Maxwell’s equations are valid 
under all circumstances.” 106 Whereupon Einstein rose and declared: 
“Very remarkable! There must be something behind it. I do not believe 
that the derivation of the absolute value of the Rydberg constant is purely 
fortuitous.” 107 

Also in September, 1913, at the 83d meeting of the British Association 
for the Advancement of Science, Jeans, in “the most important discussion 
of Section A, if not of the whole meeting,” 108 gave a summarizing report 
on the subject of radiation. 109 Stating that “any discussion of the nature of 
radiation is of necessity inextricably involved in the larger question as to 
the ultimate form of the laws which govern the smallest processes of na¬ 
ture” and acknowledging the need for “a very extensive revision” of the 
laws which so far have been believed to be expressible in the form of 
differential equations, he spoke highly of Bohr’s new theory. Although 
fully aware that this theory contains difficulties, still unsurmounted, 
“which appear to be enormous,” such as the difficulty of explaining the 
Zeeman effect, Jeans regarded “the series of results obtained [by Bohr as] 
far too striking to be dismissed merely as accidental.” He called Bohr’s 
theory “a most ingenious and suggestive, and I think we must add con¬ 
vincing, explanation of the laws of spectral series.” 110 The favorable ac¬ 
ceptance of the new ideas was advanced by extremely good coverage of 
this meeting by the press and, in particular, by the report published in the 
Saturday issue of The Times 111 with its reference to “Dr. Bohr’s ingenious 
explanation of the hydrogen spectrum,” the article in Nature 112 which 
called Bohr’s theory “convincing and brilliant,” as well as Norman 
Campbell’s resume on the structure of the atom, also published in Na- 
ture , 113 which described Bohr’s assumptions as “simple, plausible and 
easily amenable to mathematical treatment” and as leading to results “in 
exact quantitative agreement with observation.” 

Bohr himself, however, regarded his theory as merely a “preliminary 
and hypothetical” way of representing a number of experimental facts 
which defied all explanation on the basis of ordinary electrodynamics and 
classical mechanics. In an address delivered before the Physical Society in 
Copenhagen on December 20,1913, Bohr 114 declared as his present objec¬ 
tive not “to propose an explanation of the spectral laws” but rather “to 
indicate a way in which it appears possible to bring the spectral laws into 
close connection with other properties of the elements, which appear to be 
equally inexplicable on the basis of the present state of the science.” 115 In 
fact, his intention was not to give a satisfactory answer to a definite ques¬ 
tion but to search for the right question to ask. For it was an established 
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fact for him that ordinary electrodynamics and classical mechanics are 
inadequate to account for the stability of Rutherford’s atomic model and 
that it is impossible “to obtain a satisfactory explanation of the experi¬ 
ments on temperature radiation with the aid of electrodynamics, no mat¬ 
ter what atomic model be employed. The fact that the deficiencies of the 
atomic model we are considering stand out so plainly is therefore perhaps 
no serious drawback; even though the defects of other atomic models are 
much better concealed, they must nevertheless be present and will be just 
as serious.” 116 

Not only did Bohr fully recognize the profound chasm in the concep¬ 
tual scheme of his theory, but he was convinced that progress in quantum 
theory could not be obtained unless the antithesis between quantum- 
theoretic and classical conceptions was brought to the forefront of theo¬ 
retical analysis. He therefore attempted to trace the roots of this antithesis 
as deeply as he could. It was in this search for fundamentals that he intro¬ 
duced the revolutionary conception of “stationary” states, “indicating 
thereby that they form some kind of waiting places between which occurs 
the emission of the energy corresponding to the various spectral lines.” 117 
For he clearly realized that Planck’s idea of a discontinuity in energy 
emission—the very idea which probably suggested to him the notion of 
“stationary states” as “waiting places” (“Holdepladser”)—was foreign 
to classical physics, irreconcilable with it, but nevertheless legitimate and 
necessary. It was in this search for fundamentals that he generalized the 
derivation of Rydberg’s constant from its first version, based on Planck’s 
formula, to its last formulation, based on “a connection with the ordinary 
conceptions.” 118 This analogy, however, he stressed time and again, 
should not be misleadingly interpreted as an analogy in the foundations. 
For “nothing has been said here about how and why the radiation is 
emitted.” 119 Concluding his address, Bohr declared: “I hope I have ex¬ 
pressed myself sufficiently clearly so that you appreciate the extent to 
which these considerations conflict with the admirably coherent group of 
conceptions which have been rightly termed the classical theory of elec¬ 
trodynamics. On the other hand, by emphasizing this conflict, I have tried 
to convey to you the impression that it may also be possible in the course of 
time to discover a certain coherence in the new ideas.” 120 

We thus see that, contrary to Planck and Einstein, Bohr did not try to 
bridge the abyss between classical and quantum physics, but from the very 
beginning of his work, searched for a scheme of quantum conceptions 
which would form a system just as coherent, on the one side of the abyss, 
as that of the classical notions on the other side of the abyss. 121 
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CHAPTER 3 


The Older Quantum Theory 


3.1 Quantum Conditions and the Adiabatic Principle 

Although the declared aim of Bohr’s classic paper of 1913, as its title 
implied, was the development of a general theory on the constitution of 
atoms and molecules, Bohr’s theory afforded a rigorous and adequate 
explanation only of hydrogen and hydrogen-like atoms, and all his imme¬ 
diate efforts at extending the theory to systems with more than one elec¬ 
tron failed. Even at that time the spectrum of neutral helium with its two 
electrons could not be accounted for, nor was it possible to find a reason¬ 
able arrangement of the electron orbits from which the first ionization 
potential could be deduced in agreement with observation. 

It will be recalled that Bohr referred at the beginning of his work to 
elliptic orbits of the electrons but later dealt almost exclusively with the 
special case of circular motion. He also confined his discussion to the 
nonrelativistic case assuming that the velocity of the electron is small 
compared with that of light. However, in 1914 when Curtis 1 found a small 
systematic disagreement between Bohr’s theoretical values of the wave¬ 
lengths of the hydrogen lines and their observed values, Bohr 2 introduced 
the relativistic variability of mass and derived the formula 

v= ^£^(j-_j_)[ 1+ ^;(j_ + j_)1 (3 „ 

h 3 (m+M) \n 2 n 2 ) L c 2 h 2 \n 2 n 2 2 ).I 

where e and m are the charge and the mass of the electron, M the mass of 
the nucleus, and n l , n 2 positive integers and where terms involving higher 
powers than the second of the ratio between the velocity of the electron 
and the velocity of light are neglected. The correction term depending on 
the sum of the inverse squares of the integers had the same sign as the 
deviation from the Balmer law found by Curtis but accounted for only 
one-third of the deviations observed. 

The main problem which physics had to face at that time, however, was 
the question of how to extend the theory to more complex systems. In fact, 
as early as 1911 Poincare had raised this question during a discussion at 
the Solvay Congress 3 when he asked how Planck’s treatment of the har- 
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monic oscillator and its quantum condition, according to which the ele¬ 
mentary region of the a priori probability in the pq plane was determined 
by the equation SSdp dq = h , should be extended to systems with more 
than one degree of freedom. Planck, in reply, expressed his conviction that 
the formulation of a quantum theory for systems with more than one 
degree of freedom would soon be possible. 4 Four years later the answer 
was found primarily by Planck and by Sommerfeld. 

Planck 5 solved the problem for dynamic systems of/ degrees of free¬ 
dom whose equations of motion admit / regular integrals. Generalizing 
his treatment of the harmonic oscillator, he divided the phase space by 
means of the surfaces F(p kt q k ) = const, defined by the integrals, into 
volume h f and postulated that the stationary states correspond to the /- 
dimensional intersections of these surfaces. It was later shown by Epstein 6 
and by Kneser 7 that Planck’s conditions for the characterization of the 
stationary states are equivalent to the quantum conditions as formulated 
by Sommerfeld and obtained by a completely different approach. 

For a full comprehension of Sommerfeld’s approach the conceptual 
situation of the quantum theory in the years 1913 to 1915 has to be re¬ 
viewed. The outstanding success of Bohr’s theory as far as hydrogen 
atoms are concerned suggested that it would be advisable to study not only 
the content of the theory but also its methodological approach. This 
study, now, made it increasingly clear that the quantum-theoretic treat¬ 
ment of a dynamical system consisted of three parts: first the application 
of classical mechanics for the determination of the possible motions of the 
system; second, the imposition of certain quantum conditions for the se¬ 
lection of the actual or allowed motions; and, third, the treatment of the 
radiative processes as transitions between allowed motions subject to the 
Bohr frequency formula. It was hoped that by applying this scheme of 
ideas to systems of arbitrarily many degrees of freedom, a systematic 
method of bringing abstract order to the increasing wealth of experimen¬ 
tal results could be obtained. The detailed elaboration of this peculiar 
synthesis of classical and quantal conceptions, usually referred to as “the 
older quantum theory,” led eventually to the establishment of two general 
principles which may be regarded as the foundations of the theory: the 
adiabatic principle and the correspondence principle. 

The first major achievement of the older quantum theory was Arnold 
Sommerfeld’s generalization of Bohr’s theory of the hydrogen atom. 8 As 
early as 1891 Michelson 9 had discovered that the Balmer series was not 
composed of truly single lines. This discovery, incompatible, of course, 
with Bohr’s theory, was either ignored or not regarded as a weighty argu¬ 
ment against Bohr’s theory in view of the small order of magnitude in¬ 
volved. Sommerfeld, however, suspected that Bohr’s analysis of the hy- 
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drogen atom was only approximately correct, based as it was on only one 
quantum condition, the quantization of the angular momentum. Som- 
merfeld therefore hoped that a generalization to two degrees of freedom, 
corresponding to the two-dimensional motion of the electron in its orbital 
plane, would lead to a fuller agreement with experience and at the same 
time indicate how systems with more than one degree of freedom have to 
be treated. 

Recognizing that the quantization of the angular momentum could be 
expressed, in terms of plane coordinates, by the requirement that 
dq> = n^h, whereis the (angular) momentum corresponding to the 
azimuthal angle <p and where the integration is extended over the period of 
q >, Sommerfeld postulated that the stationary states of a periodic system 
with/ degrees of freedom are determined by the conditions that “the phase 
integral for every coordinate is an integral multiple of the quantum of 
action” 10 or that for k = 1,2,...,/ 



where p k is the momentum corresponding to the coordinate q ki n k is a 
non-negative integer, and the integration is extended over a period of q k . 

These “ Sommerfeld conditions,” as the quantum conditions (3.2) 
were subsequently called, were also obtained—even a few weeks before 
their establishment by Sommerfeld—by Wilson 11 and, independently, by 
Ishiwara 12 in their attempts to develop a general theory to cover in a 
unified way Planck’s treatment of the harmonic oscillator and Bohr’s 
discussion of the hydrogen atom. It was, of course, immediately obvious 
that the generalized quantum conditions (3.2) comprise in a single for¬ 
mula the correct quantization for these dynamic systems. 13 

Wilson considered a multiply periodic system whose kinetic energy is a 
linear function of the squares of the generalized velocities q k . Consequent¬ 
ly, 2L = J t q k ( dL /dq k ) = 2 2 L ki where 2 L k = q k p k . Hence 2 f L k 
dt = S Pk dq kf where the integration is extended over the period 1/v*. 
corresponding to q k . Wilson now formulated his quantum conditions as 
follows: “The discontinuous energy exchanges always occur in such a way 
that the steady motions satisfy the equations: / p k dq k = n k h f where n k 
are positive integers (including zero) and the integrations are extended 
over the values p k and q k corresponding to the period 1/v*..” 14 In addition 
Wilson postulated the two following hypotheses: “(1) Interchanges of 
energy between dynamical systems and the aether, or between one dy¬ 
namical system and another are ‘catastrophic’ or discontinuous... 
(2) The motion of a system in the intervals between such discontinuous 



96 


Conceptual Development of Quantum Mechanics 


energy exchanges is determined by Hamiltonian dynamics as applied to 
conservative systems.” Ishiwara formulated his quantum conditions as 
follows: “In nature, motions always occur in such a way that every phase 
plane /? x ^ x can be divided in those elementary regions of equal probability 
whose average value in a definite point of the phase space 

h = ( l/j) £ I q t dp, is equal to a universal constant.” In contrast to 

/ = i J 

Wilson and Ishiwara, who never applied their theory to the calculation of 
atomic spectra, Sommerfeld used his conditions for a relativistic treat¬ 
ment of the hydrogen atom. Resulting in the famous theory of the fine 
structure of hydrogen, Sommerfeld’s discovery, in full agreement with 
observation, made a great impression at the time and overshadowed the 
work of the others. 

According to classical mechanics the motion of the electron in the 
hydrogen atom is an elliptic orbit in a fixed plane. “ Guided by the experi¬ 
ence of the Balmer series” 15 as well as by Debye’s extension 16 of Planck’s 
treatment of the harmonic oscillator to arbitrary periodic motions of one 
degree of freedom, Sommerfeld regarded the Kepler motion as a two- 
dimensional problem. Introducing plane polar coordinates, the radius 
vector r and the azimuthal angle xp in the plane of the orbit, he postulated 
that not only ^ but also r has to be subjected to quantum conditions. 17 
From the radial and azimuthal quantum conditions 



dr = rih 



(3.3) 


where n! is the radial quantum number and k the azimuthal quantum 
number, he deduced that k/(k + ri) = b/a, where b is the minor and a 
the major semiaxis of the ellipse, and showed that the energy of the corre¬ 
sponding stationary orbit is given by the equation E = — Rhc/(k + n') 1 
or, for hydrogen-like atoms with Z> 1, by E= — RhcZ 2 /(k + n') 2 . 
Since the case k = 0, corresponding to linear orbits through the nucleus at 
the origin, had to be excluded, the set of values for k + n' coincided with 
the set of values for r in Bohr’s formula E = — Rhc/r 1 for circular orbits. 
Thus in spite of the greater multiplicity of orbital shapes, Sommerfeld’s 
theory so far did not lead to any additional energy levels. 

Disappointed by this result, Sommerfeld treated the same problem as a 
system of three degrees of freedom. For this purpose he introduced the 
polar coordinates r, 6, and q> with the nucleus as origin, where r as before 
denoted the radius vector, 6 the latitude (or more precisely the colati¬ 
tude) with reference to a given polar axis, and q> the azimuth. From the 
quantum conditions §p r dr=rih> fp^dq) = n x h, and §p e dO = n 2 h, 
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where ri,n Xi and n 2 are the radial, equatorial, and latitudinal quantum 
numbers respectively, and from the comparison of the expression for the 
kinetic energy in terms of r, 6 , <p with its expression in terms of r and tp, he 
found that k = n x + n 2 . Since the total angular momentum = kh /2ir 
is normal to the orbital plane and its projection upon the polar axis is 18 /^, 
Sommerfeld concluded that n x =/ccosa or cos a = n x /(n x +n 2 ), 
where a is the angle between the direction of p^ and the polar axis. This 
equation indicated the existence of discrete inclinations of the orbital 
plane with respect to the polar axis, a relationship which was referred to as 
“space quantization” and which was physically significant whenever the 
polar axis was uniquely determined, as, for instance, by the direction of a 
magnetic field. 

Yet even this treatment of the hydrogen atom as a system of three 
degrees of freedom did not increase the number of energy levels. The 
reason, as we shall see presently, was the two-fold degeneracy of the sys¬ 
tem which Sommerfeld treated as a system of three degrees of freedom but 
which had only one degree of periodicity. 

Finally, in the second part of his paper 19 Sommerfeld treated the prob¬ 
lem relativistically. He showed that as in the case of every periodic motion 
under the influence of a central force, the electron with rest mass m de¬ 
scribes a rosette or, more precisely, an ellipse with a slowly precessing 
perihelion and with one of its foci at the nucleus. Originally Sommerfeld 
solved the problem by a relativistic generalization of the nonrelativistic 
treatment of the Kepler motion. Later 20 he realized that the “royal road” 
for solving the problem is the method of separating the variables in the 
corresponding Hamilton-Jacobi differential equation. As shown by the 
theory of relativity, the Hamiltonian, which in this case is also the total 
energy E, is given by the expression 

E = mc 2 (y~ l — 1) — 

r 


where y = (1 — v 2 /c 2 ) 1/2 , v is the velocity of the electron, and where plane 
polar coordinates r and tp are used as in (3.3). Since p 2 + p^/r 2 = y~ 2 
m 2 v 2 f the Hamilton-Jacobi equation reads 



= 2mE + 


2mZe 2 
r 



?)■• 


It can be separated in the coordinates r and tp and only in these. The 
coordinate ip being cyclic (ignorable),/^ is a constant and the azimuthal 
quantum condition yields 2irp^ — kh. On the other hand, the radial quan- 
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turn condition, in which, because of the advance of the perihelion, the 
range of integration extends from ^ = 0 to if> = 2ir/^l — Z 2 /p^ 2 > leads to 
the equation — 2m(y[C — B /^A ) = n'h, where C = — p^ + ZV/c 2 , 
B = mZe 2 + Ze 2 E /c^and A = 2mE + E 2 /c 2 . Introducing the fine-struc¬ 
ture constant a = 2ire L /he and eliminating Sommerfeld obtained for 
the total energy E = E nk the expression 


E nk = mc 2 ^ 1 + a 2 Z 2 (n — k + V k 2 — a 2 Z 2 ) 2 ] 1/2 — 1} 


where n = ri + k. Expanded in powers of a up to a 2 



— Z 2 Rhc 






(3.4) 


which for circular orbits (n = k) and Z = 1 agrees with Bohr’s result 
(3.1). 

The relativistic correction term, that is, the term containing the factor 
a 2 , is a function of n and k and the energy levels are therefore indeed 
multiplets. Sommerfeld obtained the result he expected: a theoretical ac¬ 
count of the fine structure of hydrogen lines. As formula (3.4) shows, it 
should have been easier, owing to the factor Z 4 , to observe the fine struc¬ 
ture in the spectrum of ionized helium than in that of hydrogen. In fact, 
Paschen’s precision measurements of the helium spectrum 21 appeared to 
be in perfect quantitative agreement with Sommerfeld’s prediction, and 
Michelson’s early discovery found its explanation. Incidentally, Pas¬ 
chen’s confirmation of Sommerfeld’s theory served also as an indirect 
confirmation of Einstein’s relativistic formula for the velocity dependence 
of inertial mass. For Glitscher, 22 following a suggestion by W. Lenz, 
showed that the alternative mass variation formula, 23 proposed by Abra¬ 
ham in accordance with his theory of the “rigid electron,” would have 
implied a splitting of the helium lines radically different from that actually 
observed by Paschen. 

Sommerfeld’s theory of the fine structure soon became the subject of 
extensive studies and detailed elaborations. In his dissertation 24 entitled 
“Intensities of Spectral Lines” Kramers calculated the relative intensities 
of the fine-structure components on the basis of Bohr’s correspondence 
principle (which will be discussed in Sec. 3.2) and obtained results which 
generally agreed with Paschen’s measurements. A notable exception, as 
pointed out by Goudsmit and Uhlenbeck 25 and also by Lande, 26 was the 
component 4j — , that is, the transition from the energy level n = 4, 

k = 1 to the level n = 3, k = 1 in the fine structure of helium and, as 
Hansen 27 observed, also in that of hydrogen. The selection rule Ak = ± 1 
forbids the appearance of this component and yet according to Paschen’s 
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measurements, the line had a relatively high intensity. Any explanation of 
its appearance on the basis of the Stark effect, which, as is known, invali¬ 
dates the selection rule, was shown to be untenable. 

In an attempt to render Sommerfeld’s approach more rigorous from 
the relativistic point of view, Darwin 28 replaced the ordinary Coulomb 
potential in Sommerfeld’s expression for the total energy by a retarded 
potential. Wilson’s suggestion 29 to reformulate the quantum conditions in 
accordance with the general theory of relativity by postulating that f (P s 
+ eA s ) dq s = n s h for s = 1,2, 3, and 4, where A s is the vector potential, 
was shown by Richardson 30 to lead to inconsistencies after a similar at¬ 
tempt by Kar 31 had been shown by Smekal 32 to be untenable. Attempts at 
reconciling the quantum conditions with the general theory of relativity 
were made by Schrodinger 33 and, in particular, by Wereide, 34 who, en¬ 
couraged by Bohr and Kramers, systematically studied the possibility of 
applying relativity to the Rutherford-Bohr model of the atom. All these 
relativistic considerations, however, were soon invalidated by the subse¬ 
quent discovery of the spin, and thus had little impact on the later develop¬ 
ment of the theory. 

The generalized quantum conditions (3.2) were for Sommerfeld the 
ultimate foundations of quantum theory or, as he expressed it, statements 
“unproved and perhaps incapable of being proved.’’ 35 Strictly speaking, 
Sommerfeld was right as far as contemporary physics was concerned. For 
only in 1926, after the advent of wave mechanics, could Sommerfeld’s 
conditions be rigorously deduced as approximations with the help of the 
Wentzel-Kramers-Brillouin (WKB) method. 

Sommerfeld’s conditions were not so arbitrary as they seemed to be at 
first. In fact, their deeper physical significance was soon revealed with the 
establishment of the “adiabatic principle,” which, as stated previously, 
was one of the conceptual foundations of the older quantum theory. The 
origin of this principle and its role in the development of the older quan¬ 
tum theory will now be discussed. 

The concept of “adiabatic changes” in purely mechanical systems ori¬ 
ginated in the early attempts 36 of Boltzmann 37 and Clausius 38 to reduce 
the second law of thermodynamics to pure mechanics. Boltzmann 39 
showed that, provided the laws governing the behavior of a mechanical 
system are subject to the principle of least action, the kinetic energy (in 
analogy to the addition of heat dQ ) supplied to a simply periodic system of 
period r is given by the expression 2 d(rE kin )/r, where E kin = (1/r )So 
E kin dt is the average kinetic energy of the system. Hence, if no external 
energy is supplied (corresponding to the adiabatic condition dQ = 0), the 
ratio E kin /v is an invariant. At the same time, Clausius emphasized in this 
context the importance of the study of rheonomous systems acted upon by 
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external forces whose law of action varies with time and whose potentials 
contain parameters which vary slowly compared with the natural periods 
of the system. In such processes the motions determining coordinates of 
the system were not directly affected by the external forces, just as in an 
adiabatic change of state in thermodynamics the coordinates which deter¬ 
mine the heat motion are not affected by external agencies. The introduc¬ 
tion of these ideas into the theory of so-called cyclic (in the topological 
sense of the word!) systems as studied primarily at the end of the century 
by Helmholtz 40 and Hertz 41 led to the definition of an “adiabatic motion” 
as the motion of a system which is acted upon by external forces in such a 
manner that these forces do not affect directly the coordinates of the 
system while certain parameters undergo slow variations. 42 

In 1902 Lord Rayleigh pointed out that in certain sinusoidally oscillat¬ 
ing systems—such as a simple pendulum with its string being slowly 
shortened or a transversally vibrating string being slowly covered by a 
narrow ring or the standing waves in a slowly contracting cavity—such 
adiabatic motions take place and that during the process the ratio between 
the energy and the frequency remains unchanged. 43 At the Solvay Con¬ 
gress in 1911 Lorentz raised the question whether a quantized pendulum 
whose string is being shortened remained in its quantized state. Einstein 
answered unhesitatingly: “If the length of the pendulum is changed infi¬ 
nitely slowly, its energy remains equal to hv if it was originally Av.” 44 

At the same time Paul Ehrenfest already recognized the fundamental 
importance of the concept of adiabatic invariance for quantum theory. In 
his study of the statistical foundations of Planck’s radiation law, a study 
which he began as early as 1906 if not earlier, 45 Ehrenfest was puzzled by 
the apparent paradox that Wien’s displacement law “which is wholly 
derived from classical foundations” remained “unshaken in the midst of 
the world of radiant phenomena whose anticlassical character stood out 
ever more inexorably.” 46 Wien’s law, it will be recalled, asserted 47 the 
proportionality between the energy density u v (v,T) and v*F(v/T) or, 
since u v referred to 8 ttvVc 3 proper vibrations within the frequency inter¬ 
val dv at v, the equality between the energy e v of each proper vibration of 
frequency v and v<P(v/T), where <&(v/T) differed from F(v/T) merely 
by a constant factor. Writing this relation as e v /v = 4>( v/T), Ehrenfest 
understood that Wien’s displacement law established a relation between 
two adiabatic invariants, 48 e v / v and v/T. But Ehrenfest now also recog¬ 
nized the reason why Wien’s original derivation of his displacement law, 
based on an adiabatical and infinitely slow compression of the radiation 
cavity, remained valid in quantum theory. For, e v /v being an adiabatic 
invariant, Planck’s quantum condition e v = nhv for each proper oscilla¬ 
tion, if satisfied initially, will be satisfied at every moment during the 
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process of compression. Hence the validity of Wien’s law in quantum 
theory was assured by the fact that the ratio e v /v for the sinusoidal vibra¬ 
tions before the adiabatic change equals the ratio e' /v' for the modified 
sinusoidal vibrations after the adiabatic change. Referring to the former 
state of vibrations as the “undeformed” state or motion and to the latter as 
the “deformed” state or motion, Ehrenfest pointed out that Wien’s law 
implied that in the course of an adiabatic transformation an allowed (or 
stationary) undeformed motion changes into an allowed deformed mo¬ 
tion while the adiabatic invariant retains its initial value. The assumption 
that this conclusion applies quite generally, and not only for sinusoidal 
motion, forms the content of what Ehrenfest, following a suggestion of 
Einstein, 49 called the “adiabatic principle.” The first full exposition of this 
principle, with reference to Boltzmann’s “mechanical theorem” men¬ 
tioned previously, appeared in a paper 50 which Ehrenfest published in 
1913, shortly after his appointment as Lorentz’s successor in Leyden. 
Three years later, in a comprehensive essay 51 on this subject, Ehrenfest 
formulated the principle as follows. Let the coordinates of the system, 
described by q x contain certain “parameters” a l , a 2 ,...,a n , the val¬ 
ues of which can be altered slowly, and let the kinetic energy of the system 
be a homogeneous quadratic function of the velocities the 

coefficients of which are functions of the q x f - -,q n and possibly also of the 
a x . If an infinitely slow change of the parameters transforms motion 
0(a) into another motion/?(a'),/?( a), and/?(a') are called “adiabatically 
related to each other” and the change affected “a reversible adiabatic 
affection.” If it is assumed that /3(a 0 ), corresponding to the particular 
values a l0 , a 20f ... f a n0 of the parameters, is an allowed motion, then the 
adiabatic principle asserts that for general values of a x , a 2 ... of the param¬ 
eters those and only those motions 0(a) are allowed which are adiabatic¬ 
ally related to 0(a o ), that is, those motions which can be transformed 
into, or may be derived from, allowed motions by a reversible adiabatic 
affection. 

For Ehrenfest, the main importance of the adiabatic principle—apart 
from the fact that it determined the formal applicability of classical me¬ 
chanics for quantum theory in the case of time-dependent parameters— 
was its use for the determination of the stationary states of deformed 
systems which are adiabatically related to undeformed systems with 
known quantum conditions. Thus, as Ehrenfest pointed out, the adiabatic 
principle made it possible to determine the allowed motions of any period¬ 
ic system of one degree of freedom if it is adiabatically related to the 
sinusoidal oscillator. For in order to find the quantum condition for the 
deformed system, it was only necessary to find some function J of the 
parameters and constants of motion which is invariant under the transfor¬ 
mation and which coincided with e v /v, the ratio between the energy and 
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the frequency of the oscillator at the beginning of the transformation. 
Once such an adiabatic invariant is found, the quantum condition for the 
deformed system reads J = nh, where n is a nonnegative integer. 

The existence of adiabatic invariants for systems of / degrees of free¬ 
dom was proved in statistical mechanics as early as 1910 by P. Hertz, 52 
who showed that the 2/-dimensional volume V= f • • • J dq x • • • dp n which 
in phase space is enclosed by the (2/— 1)-dimensional hypersurface 
e{q,p,a) = const is invariant under reversible adiabatic transformations. 
For/= 1, Ehrenfest pointed out, this result coincided with Boltzmann’s 
“mechanical theorem.” For in this case 

—~ ==2 \ 0 dt = ^^^~) di = ^v> dt = JJ dqdp= V 

Since for the harmonic oscillator 2 E kin = e v = nhv and the adiabatic in¬ 
variant J = e v /v = nh, the quantum condition for any periodic system of 
one degree of freedom which is adiabatically related to the oscillator is, 
according to Ehrenfest, </> pdq = nh. Although the adiabatic principle 
thus seemed at first to afford a systematic approach to the quantum- 
theoretic treatment of a great variety of dynamical systems, its practical 
application quite frequently met with insurmountable difficulties. For the 
method was of no avail whenever the adiabatic transformation had to 
involve a state in which a period of motion tends to infinity (as, for exam¬ 
ple, in the transformation from a pendulum to a rotator). For in such a 
case the condition of an infinitely slow variation of the parameters in 
comparison to the natural periods of the system could no longer be satis¬ 
fied. 

In 1916 Ehrenfest 53 established the conformance of Sommerfeld’s 
quantum conditions (3.3) for the hydrogen atom with the adiabatic prin¬ 
ciple by showing that the phase integrals in (3.3) are indeed adiabatically 
invariant. Integrating Lagrange’s azimuthal equation of motion, he ob¬ 
tained that p^, = mr 2 tfr is a constant which corresponds to the coordinate 
q 2 =$ with the period = rp/27T. The adiabatic invariant 2Elf^/v^ is 
therefore equal to 


P^ 

tfj/2 TT 




which is Sommerfeld’s first phase integral. Further, if^r,^ ,a 2 ,...) is the 
potential of the central force, depending on the parameters a x 
Lagrange’s radial equation mr — mry 1 + dx/dr = 0 or mr = 
Pv 2 /mr* — dx/dr may be interpreted as the equation of a linear motion, 
between the limits r x <r<r 2 , subject to a force with the potential 
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<£_= —p^/lmr 2 + %, and the corresponding adiabatic invariant is 
2E kfn/v r = § p r dr, which is Sommerfeld’s second phase integral. 

The adiabatic principle thus revealed the mystery of the quantum con¬ 
ditions. In fact, Bohr’s early quantum condition according to which “the 
angular momentum of the electron round the nucleus is equal to an entire 
multiple of h /2if 9 (see page 80), or iTrmvr = nh, now lost its air of magic. 
For lirmvr = 2-\mv 2 ' (2 ir/v) = 2E kin /vis—already according to Boltz¬ 
mann’s theorem—an adiabatic invariant. Sommerfeld’s result, at first un¬ 
expected, according to which the spectrum of the hydrogen atom remains 
unaltered whether the atom is treated as possessing only circular orbits or, 
more generally, as having elliptic orbits with a great variety of eccentric¬ 
ities, lost its moment of surprise. For it could be shown that the system 
with a circular orbit is adiabatically related (in the sense of Ehrenfest) to a 
system with an elliptic orbit of the same time period and hence E^ n must 
be the same for both systems. Now, according to the virial theorem of 
classical mechanics, for inverse-square-law forces, 2 E kin = — E^ so 
that both systems with the same period possess also the same total energy. 
Consequently, in consideration of Kepler’s third law, an allowed circular 
orbit in the hydrogen atom is energetically equivalent to an allowed ellip¬ 
tic orbit whose major axis equals the diameter of the former. But this was 
exactly Sommerfeld’s result. 

The adiabatic principle, as we see, made an important contribution to a 
profounder understanding of what had happened so far in the develop¬ 
ment of quantum theory. But certain questions on basic aspects still re¬ 
mained obscure. One of the problems so far unsolved was the choice of the 
proper coordinates. The Kepler problem, for example, could be treated in 
parabolic coordinates as well as in polar coordinates, but the quantum 
conditions and the quantized orbits that resulted from these two treat¬ 
ments were different for different systems of coordinates. 

The question of this ambiguity was successfully solved and a criterion 
for the correct selection of coordinates was established by Paul Sophus 
Epstein, a former student of Sommerfeld, and by Karl Schwarzschild, the 
director of the Astrophysical Observatory at Potsdam. Although intend¬ 
ed to solve a specific question in quantum theory, their work was essential¬ 
ly the conclusion of a problem in theoretical mechanics which had a long 
history of its own. When Jacobi solved the “one-body problem,” that is, 
the determination of the motion of a mass particle in the gravitational field 
of a stationary central body, he showed 54 that this Kepler problem can be 
solved by separating the Hamilton, later called the “Hamilton-Jacobi dif¬ 
ferential,” equation in two different sets of coordinates. Since then ques¬ 
tions concerning the existence and uniqueness of separable variables and 
the characteristics of motions associated with them had engaged the at¬ 
tention of theoretical physicists and mathematicians. 
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In 1887 Staude, 55 for systems with two degrees of freedom, and later 
Stackel, 56 for systems with an arbitrary finite number of degrees of free¬ 
dom, showed that the motion of a system whose Hamilton-Jacobi differ¬ 
ential equation can be integrated by separation of variables is “multiply 
periodic” or, in the terminology of astronomy, “conditionally periodic.” 
In other words, the generalized coordinates q k can be represented for a 
system with / degrees of freedom as /-fold infinite Fourier series of the 
form 


q k = £ C t%t, exp + ■■■ +r f w f )] (3.5) 

T |,...,Ty= — OO 

where w k —v k t and where the summation extends over all integral values 
of the r k . The motion, in general, is not periodic, its orbit having the 
characteristics of an “open” Lissajous figure. Only if the v k are commen¬ 
surable does the current point of the orbit ever return strictly to its starting 
point. Stackel also showed that for systems with a separable Hamilton- 
Jacobi differential equation the integral of action 2 J E kin dt can be sepa¬ 
rated into a sum ^ f [F k (q k ) ] 1/2 dq k . Each of the/ terms in this sum 

depends only on one of the generalized coordinates, and these oscillate 
between two fixed limits ( libration ) or are such that their increase by a 
certain constant value leaves the configuration of the system unchanged 
( rotation ). 

Meanwhile action-angle variables were introduced and their use was 
recognized as a powerful technique for the determination of the frequen¬ 
cies of periodic motions. If q k9 p k are a pair of separation variables, the 
action variable was defined as J k = § p k dq k9 where the integration ex¬ 
tends over a complete libration or rotation of the coordinate q k , whatever 
the case may be. The generalized coordinate conjugate to J k was known as 
the “angle variable” w k and satisfied w k = dW/dJ k , while W'\$ the gener¬ 
ating function of the contact transformation. From the canonical equa¬ 
tion 


w 


k 


dH(J l9 ... 9 J f ) 
dj k 


where the v k are constant functions of the action variables, and from the 
fact that the w k change by unity when the q k go through a complete period 
[since A w k = $ ( dw k /dqj ) dqj = $ /dqj dJ k ) dqj = (d/dJ k ) </> Pj 
dqj = dJj /dJ k = S jk ] it follows, if T k is the period associated with q k9 
that A w k = 1 = v k T k9 so that v k is the frequency associated with the 
periodic motion of q k . Thus, as we see, the method of action-angle vari- 
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ables provided the frequencies of periodic motions without affording their 
complete solution. It was therefore quite natural that this technique devel¬ 
oped originally in astronomy and was of little importance in general theo¬ 
retical mechanics. In fact, Delauney, who introduced this method in his 
Theorie du Mouvement de la Lune, 51 in which he accounted for most of 
the lunar accelerations which Laplace’s theory, as corrected by J. C. Ad¬ 
ams. left unexplained, made no mention of it in his Traite de Mecanique 
Rationelle . 58 Nor did other texts on theoretical mechanics, prior to the 
work of Schwarzschild and Epstein, give any detailed discussion on this 
subject. In astronomy, however, the importance of action-angle variables 
was meanwhile fully recognized. Poincare, in his Leqons de Mecanique 
Celeste, 59 and Charlier, in his classic textbook on celestial mechanics Die 
Mechanik des Himmels, 60 made systematic use of the method. 

It was Schwarzschild’s great merit 61 to have introduced into the quan¬ 
tum theory the analytical method of employing action-angle variables or 
“uniformizing variables,” as Bohr later used to call them. Although ori¬ 
ginally intended as theoretical accounts of the Stark effect on the basis of 
Sommerfeld’s generalization of Bohr’s theory, Schwarzschild’s and Ep¬ 
stein’s 62 papers not only clarified the problem concerning the choice of the 
coordinates to be used in expressing the quantum conditions but showed 
quite generally how quantum-theoretic problems concerning periodic or 
conditionally periodic systems have to be linked up with the Hamilton- 
Jacobi theory in classical mechanics. In fact, it was due to their work that 
in its subsequent elaborations by Bohr 63 and Bom 64 it almost seemed as if 
Hamilton’s method had expressly been created for treating quantum me¬ 
chanical problems. According to Epstein the general quantum conditions 
(3.2) should be formulated with reference to those coordinate systems in 
which the Hamilton-Jacobi differential equation allows itself to be sepa¬ 
rated. Epstein justified this rule by geometrical reasons based on certain 
properties of the envelopes of the orbit in configuration space. 

According to Schwarzschild, who like Epstein confined his treatment 
to conditionally periodic systems, the general quantum conditions should 
be formulated in terms of action-angle variables. Showing that this is 
always possible for separable systems, he paid special attention to those 
cases in which the Hamilton-Jacobi differential equation can be separated 
in various sets of coordinates. From the necessary and sufficient condi¬ 
tions of separability, found in 1904 by Levi-Civita, 65 it followed immedi¬ 
ately that with a given set of conjugate variables q kf p k satisfying the 
conditions also all those canonical variables are separable which can be 
obtained from the former by point transformations. A dynamical system 
which in addition to this class of variables is separable in still another set 
was called by Schwarzschild a “degenerate” system. Schwarzschild 
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showed that nondegeneration occurs when all the frequencies v* in (3.5) 
are incommensuable with one another. If, however, these frequencies sat¬ 
isfy n linear equations with integral coefficients, the system has an w-fold 
degeneration. In this case the orbit does not, as in the nondegenerate case, 
fulfill in the configuration space an /-fold, but only an (f— n ) -fold contin¬ 
uum and the system can be transformed by an appropriate change of 
variables into a svstem oif—n degrees of freedom. For degenerate sys¬ 
tems, according to Schwarzschild, the number of quantum conditions J k 
= </> p k dq k = n k h should be equal to the degree of periodicity, the num¬ 
ber of incommensurable frequencies in (3.5). Planck’s quantum condi¬ 
tions, 66 as Epstein 67 showed, embrace those of Schwarzschild. A math¬ 
ematically rigorous investigation of the equivalence between Planck’s, 
Schwarzschild’s, and Epstein’s quantum conditions was carried out by 
Kneser. 68 

Burgers, 69 a collaborator of Ehrenfest in Leyden, and Bom 70 studied 
the conditions which have to be imposed upon (not necessarily separable) 
dynamical systems with time-independent Hamiltonians in order to as¬ 
sure the possibility of introducing action-angle variables appropriate for 
the formulation of quantum conditions. They had to assume (1) that— 
just as in separable systems—a canonical transformation exists which 
transforms the q k ,p h into new variables w k9 J k by means of a generating 
function S(q x 9 J X 9 ... 9 q f9 J f ) such that the configuration of the system is a 
periodic function in the angle variables w k with the primitive period 1; (2) 
that the Hamiltonian transforms thereby into a function W which de¬ 
pends only on the action variables J k \ and (3) that the function 

S*=S-Yw k J k 


for which 


Pk = 


dS*(q l9 w l9 ...) 

dq k 



dS* 

dw k 




is periodic in the w k with period 1. It could then be shown that the/* (in 
the degenerate case their number is the degree of periodicity) are uniquely 
determined up to a homogeneous linear transformation with integral coef¬ 
ficients and a determinant ± 1. 

In addition Burgers 71 could show that the action variables are adiabatic 
invariants. Since the generating function has now to be considered as 
time-dependent, the transformed Hamiltonian H satisfies 
H = H + dS */dt 9 where H regarded as a function of the action-angle vari¬ 
ables is independent of the w k . The transformed canonical equation for 
the action variable reads 
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dS* \ 
dt ) 

or since S *, like S, depends on t only through the parameter a , 



In view of the slow variation of a Burgers deduced that 



and, utilizing a method introduced by Delauney and elaborated by Poin¬ 
care, 72 showed that the left-hand side of the last equation is of the order of 
magnitude a(t 2 — t l ), so that for infinitely slow variations of the param¬ 
eters a(a-+ 0), provided a ( t 2 — t x ) remains finite, A J k = 0. Certain diffi¬ 
culties connected with the violation of the incommensurability conditions 
were later removed by von Laue, 73 and finally Levi-Civita 74 studied rigor¬ 
ously the mathematical foundations of the theory of adiabatic invariants. 

The adiabatic principle was not only a constructive method for the 
establishment of quantum conditions and the determination of allowed 
motions. It also served as a criterion for the correctness of a given quanti¬ 
zation. Planck’s treatment 75 of the asymmetric top, whose moments of 
inertia were subjected to an adiabatic change during which the boundaries 
of the elementary regions in phase space remained unaltered, is a classic 
example of this use of the principle. Finally, the adiabatic principle could 
also be applied directly to the treatment of atomic systems which were 
exposed to the influence of external forces, provided the degree of period¬ 
icity was not changed by the presence of these forces. One of the most 
brilliant applications of the principle, in this sense, and, at the same time, 
one of the most striking achievements of the older quantum theory was the 
explanation of the Stark effect by Schwarzschild and Epstein. Let us then 
briefly recall the history of this effect. 

According to the principles of classical physics a homogeneous exter¬ 
nal electric field should not affect the type of motion of an electron in the 
atom. A constant field superimposed upon a quasi-elastically bound elec¬ 
tron will merely shift the center of the orbit by an amount proportional to 
the intensity of the field, as the equation of motion immediately shows. 
Yet in spite of these arguments and in spite of the absence of any observa¬ 
tional evidence at that time, Voigt 76 predicted in 1901, a few years after 
Zeeman’s discovery of the splitting of spectral lines in magnetic fields, an 
analogous effect for electrostatic fields. Voigt also construed a theory to 
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account for such an effect but showed that even for rather strong field 
intensities the splitting would be too small to be experimentally observ¬ 
able. 

When Stark, 77 however, about ten years later, studied atomic ions and 
the spectra of their canal rays and came to the conclusion that the change 
of the electric state of an atom due to ionization entails a change of its 
“optical frequencies,” he decided to investigate the problem experimen¬ 
tally, in defiance of Voigt’s discouraging remarks. Since Geissler tubes, 
the conventional sources for hydrogen lines, proved incapable of sustain¬ 
ing powerful electrostatic fields, Stark applied such fields to canal rays 
and investigated the luminescence of a canal-ray tube in a layer directly 
behind the perforated cathode. Thus, in his laboratory at the Technische 
Hochschule in Aachen, Stark 78 discovered in 1913 that an electric field 
splits the lines of the Balmer series into a number of components. This 
phenomenon, which, as Stark already recognized, was by no means pecu¬ 
liar to hydrogen, was subsequently called the “Stark effect” or sometimes 
also the “Stark-LoSurdo effect.” 

For at the same time—or possibly even earlier 79 —LoSurdo, 80 studying 
the Doppler effect and examining the radiation emitted by the gas mole¬ 
cules in the Crookes dark space between the cathode and the negative 
glow in a discharge tube, found, like Stark, that the hydrogen lines split 
into several components. Although LoSurdo’s experimental setup was 
much simpler than Stark’s, he could obtain only qualitative results, hav¬ 
ing no direct method at his disposal for a precise measurement of the steep 
potential gradient as a function of position inside the tube. 

Stark’s discovery that each Balmer line is split up into components 
whose number increases with the series number of the line, that the resolu¬ 
tion is proportional to the strength of the electric field, and that the com¬ 
ponents are linearly polarized when viewed transversely to the direction 
of the field soon became the subject of intensive theoretical research. The 
first attempt to account for these results was made by Warburg, 81 who 
succeeded in interpreting the general features of the effect on the basis of 
Bohr’s theory. Bohr’s own early work 82 on the Stark effect resulted in a 
good qualitative agreement with experience, but the finer details could not 
be explained. 

In his first paper on the subject, published in 1914, Schwarzschild 83 
discussed the Stark effect along classical lines and showed that the math¬ 
ematical treatment of the effect can be regarded as a limiting case of Jaco¬ 
bi’s 84 solution of the motion under the influence of two fixed centers of 
Newtonian (or Coulombian) attraction, a case which according to Jacobi 
is separable in elliptical coordinates (i.e., in the parameters of families of 
confocal ellipses and hyperbolas whose foci are at the fixed centers of 
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force). Schwarzschild now pointed out that the situation arrived at if one 
center of force is removed to infinity while its attractive power is corre¬ 
spondingly increased is that of the Stark effect. The families of ellipses and 
hyperbolas become in the limit confocal parabolas whose focus lies at the 
remaining fixed center which coincides with the nucleus. The Hamilton- 
Jacobi differential equation for the Stark effect, Schwarzschild concluded, 
is therefore separable in parabolic coordinates. 

Thus, using parabolic coordinates £, 1 7 and the azimuthal angle tp with 
respect to the direction of the field, Schwarzschild 85 and Epstein , 86 in their 
classic papers of 1916, knew from the beginning that the problem under 
discussion is that of a conditionally periodic motion and that each of the 
corresponding momenta p^p^p^, is the square root of a simple rational 
function of the corresponding position coordinate. Owing to the external 
field, the motion is nondegenerate and subject to the three quantum condi¬ 
tions 




= n 


3 


h 


where the parabolic quantum numbers n x and n 2 are nonnegative integers 
and the equatorial quantum number n 3 is a positive integer (zero has to be 
excluded since it would correspond to a collision between the electron and 

the nucleus). = y[J\J£) and p v = ^f 2 (y) each contain the energy 
constant W , an integration constant/?, and the constant/?^ which by the 
third quantum condition is proportional to n 3 . Solving for W y which con¬ 
sequently depends on all three quantum numbers, Schwarzschild and Ep¬ 
stein obtained for the energy, approximated to the first power of the field 
intensity F, the expression 


- W = 


liP'mZ 2 e 4 



1 

(n l + n 2 + n 3 ) 2 


+ 


3 h 2 F 
87 ^mZe 


(n 2 — n l )(n l + n 2 + n 3 ) 


where Ze is the charge of the nucleus. The agreement between the wave¬ 
lengths of the components of the Balmer lines as calculated on the basis of 
this formula and their observed positions was excellent. In fact, Epstein 
could declare : 87 “We believe that the reported results prove the correct¬ 
ness of Bohr’s atomic model with such striking evidence that even our 
conservative colleagues cannot deny its cogency. It seems that the poten¬ 
tialities of quantum theory as applied to this model are almost miraculous 
and far from being exhausted.” 
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The older quantum theory, as we understand today, could apparently 
achieve this brilliant success because the interaction between the spin of 
the electron and the applied field is negligibly small. 88 It should also be 
recalled that the first-order Stark effect is peculiar to hydrogen. For it is 
only in hydrogen that the accidental (Coulomb field) degeneracy is re¬ 
moved by the external field; in all other atoms it is already removed by the 
internal fields owing to the other electrons in the atom. 


3.2 The Correspondence Principle 

The formal applicability of classical mechanics to quantum theory as far 
as a given stationary state is concerned was, as we have seen, the subject of 
the adiabatic principle. The formal applicability of classical mechanics to 
quantum theory as far as transitions between such states are concerned is 
the subject of the correspondence principle, the second fundamental prin¬ 
ciple of the older quantum theory. Relating the kinematics of the electron 
to the properties of the emitted radiation, the correspondence principle 
became a conceptual instrument indispensable for the quantum-theoretic 
treatment of problems concerning the finer details of the emitted radi¬ 
ation, such as the intensity of spectral lines or their polarization. Even for 
problems which could not be treated by the theory of periodic or condi¬ 
tionally periodic systems this principle produced results in fair agreement 
with experience. 

The conceptual foundation of the correspondence principle was based 
on the assumption that quantum theory or at least its formalism contains 
classical mechanics as a limiting case. This idea was expressed by Planck 89 
as early as 1906, when he showed that in the limit h -► 0 quantum-theoretic 
conclusions converge toward classical results, or, as he said: “The classi¬ 
cal theory can simply be characterized by the fact that the quantum of 
action becomes infinitesimally small.” 90 In fact, it was easy to see that for 
h tending to zero Planck’s radiation formula goes over into the classic 
Rayleigh-Jeans formula. 

The same result, however, is also obtained if for constant h the frequen¬ 
cy v approaches zero. This was the idea chosen by Bohr in his establish¬ 
ment of the correspondence principle. For, according to his frequency 
law, the convergence of v toward zero implies that the energy differences 
become arbitrarily small or, in other words, that in the limit the energies 
form almost a continuum. Since for periodic or conditionally periodic 
systems the energy can be expressed as a function of the phase integrals 
and these, in turn, as functions of the quantum numbers n, 9X in virtue of 
the quantum conditions, quantum theory approaches classical physics 
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whenever the differences A n in the quantum numbers are small compared 
with their absolute values n. 

As pointed out on page 78, Bohr applied this reasoning, though only as 
a procedure ad hoc in his treatment of the hydrogen spectrum, as early as 
1913 when he identified the classical frequency of revolution of the elec¬ 
tron with the radiation frequency of the emitted spectral line for transi¬ 
tions between the (n + 1 )st and the nth stationary state for very large n. 
In his address before the Physical Society in Copenhagen, Bohr 92 declared 
explicitly in this context that his recourse to classical physics should not 
be interpreted as a quest for causal explanation but merely as a heuristic 
device. “You understand, of course,” he said, “that I am by no means 
trying to give what might ordinarily be described as an explanation; noth¬ 
ing has been said here about how and why the radiation is emitted. On one 
point, however, we may expect a connection with the ordinary concep¬ 
tions, namely, that it will be possible to calculate the emission of slow 
electromagnetic oscillations on the basis of classical electrodynamics.” 93 

In his important memoir of 1918 “On the quantum theory of linespec- 
tra” in which Bohr discussed the application of the quantum theory to the 
determination of the line spectrum of a given atomic system, he remarked 
that even without introducing detailed assumptions as to the mechanism 
of transition between stationary states, one can show that the frequencies 
calculated by the frequency law “in the limit where the motions in succes¬ 
sive stationary states comparatively differ very little from each other, will 
tend to coincide with the frequencies to be expected on the ordinary theo¬ 
ry of radiation from the motion of the system in the stationary states.” 94 
Bohr added that these considerations would be shown to throw light on 
the question of the polarization and intensity of the different lines of the 
spectrum of a given system. This was Bohr’s first explicit formulation of 
the correspondence principle. In the memoir under discussion he did not 
yet call it by this name but referred to it as “a formal analogy between the 
quantum theory and the classical theory.” 

In Section Three of Part One of his memoir Bohr justifies his conten¬ 
tion for conditionally periodic systems as follows. 95 According to the clas¬ 
sical theory of such systems v k = dE /dJ k , where E = 7/is the total energy 
of the system—the notation of page 104 is used—whereas according to the 
frequency formula hv qu = A E. Let A J k = ( n k — n k )h = r k h , where n k 
and n k are the quantum numbers characterizing the stationary states be¬ 
fore and after the transition. On the assumption of large quantum 
numbers A E = ?,{dE/dJ k ) A J k = 'lr k v k h . Hence v qu = = v cl , 

where v cl is the classical frequency (combination of higher harmonics) 
appearing in one of the terms of the multiple Fourier expansion (3.5) on 
page 104. In the limit, therefore, the quantum-theoretic frequency v qu 
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coincides with the classical mechanical frequency v cl . By demanding that 
this correspondence remain approximately valid also for moderate and 
small quantum numbers, Bohr generalized and modified into a principle 
what in the limit may be regarded formally as a theorem. 

The expression “correspondence” as a terminus technicus in this sense 
appeared for the first time in Bohr’s paper “On the series spectra of the 
elements” 96 in the following statement: “Moreover, although the process 
of radiation cannot be described on the basis of the ordinary theory of 
electrodynamics, according to which the nature of the radiation emitted 
by an atom is directly related to the harmonic components occurring in 
the motion of the system, there is found, nevertheless, to exist a far-reach¬ 
ing correspondence between the various types of possible transitions be¬ 
tween the stationary states on the one hand and the various harmonic 
components of the motion on the other hand. This correspondence is of 
such a nature that the present theory of spectra is in a certain sense to be 
regarded as a rational generalization of the ordinary theory of radi¬ 
ation.” 97 Having meanwhile recognized the far-reaching methodological 
importance of this idea for the further development of quantum theory, 
Bohr named it “the correspondence principle” in the “Appendix to Part 
Three” of his Academy memoir. 98 Without the principle, Bohr realized, 
the quantum theory in contrast to the atomic theory of classical physics 
was an incomplete theory. Newton’s mechanics and Maxwell’s electrody¬ 
namics allowed calculating not only the frequencies but also the intensi¬ 
ties and states of polarization of spectral lines. For according to classical 
physics the absolute value of the square of the (vectorial) Fourier coeffi¬ 
cient D t in the multiple expansion representing the (vectorial) dipole 
moment served as a measure of the intensity of the corresponding partial 
vibration, characterized by the integers t,..., r f in (3.5)." The ratio of the 
x y y y and z components of this (vectorial) Fourier coefficient specified the 
polarization. Classical physics was therefore a complete theory of atomic 
spectra affording full information on the nature of the emitted radiation— 
though at variance with experience. Bohr’s theory, however, which ex¬ 
pressly disclaimed any knowledge concerning the mechanism of transi¬ 
tions between stationary states, renounced thereby from the beginning 
any possibility of a rational foundation for deriving the intensities and 
polarizations of spectral lines. The only way out of this impasse, as Bohr 
recognized, was via the correspondence principle. 

But instead of establishing a direct correspondence between the inten¬ 
sity of a spectral line and the respective Fourier coefficient in the classical 
representation of the dipole moment, Bohr incorporated into his theory 
an additional idea recently conceived by Einstein. For in 1916 Einstein 
derived Planck’s radiation law on the assumption that the transitions are 
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governed by certain probability laws similar to those postulated in the 
theory of radioactivity. 100 Denoting the probability for the occurrence of 
an externally not excited transition from the higher energy level E m to the 
lower E n , during the time interval dt, by A n m dt and the probability for the 
occurrence of an induced or stimulated transition between the same ener¬ 
gy levels in the radiation field by B n m p dt, where p is the energy density 
(which on page 4 was denoted by u y ), and finally the probability of the 
occurrence of the inverse transition (absorption) by B™p dt , Einstein 
deduced from the preservation of the canonical distribution of states un¬ 
der equilibrium conditions that 

= + (3.6) 


Assuming that p increases to infinity with T, Einstein obtained 
B™ = B n my the first, though implicit, application of what later was called 
“the principle of detailed balancing/’ Comparing the solution for p y 
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with the Wien radiation law in the limit of high frequencies and with the 
Rayleigh-Jeans formula in the limit of small frequencies, Einstein con¬ 
cluded that 


E m -E n = hv 


which is Bohr’s frequency condition, and that 
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(3.7) 


and thus obtained Planck’s radiation law, Eq. (1.16) on page 16. It should 
be noted in the present context that the very existence of the induced 
emission was suggested to Einstein—one is almost tempted to say, in 
accordance with the correspondence principle—by the classical theory 
concerning the interaction between the radiation field and a system of 
oscillating charges. 101 Had Einstein omitted the induced radiation, he 
would have obtained Wien’s, and not Planck’s, radiation law. 

In presenting these probabilistic considerations which, as Einstein 
phrased it, lead “in an amazingly simple and general way” 102 to Planck’s 
law, Einstein—contrary to Planck 103 —did not explicitly define his con¬ 
ception of probability. In view of this fact it has often been claimed 104 that, 
ironically as it sounds, Einstein, the declared protagonist of causality in 
physics and the philosopher who rejected the “dice-playing god” 105 was 
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the first to introduce the modem quantum-theoretic notion of probability, 
not as an expression of insufficient knowledge of fine-scale parameters, as 
conceived in the kinetic theory, but as an affirmation in a fundamental 
manner of the contingency of a single event. A closer analysis of Einstein’s 
paper, however, will show that this imputation is not justified. First of all, 
Einstein’s comparison of his statistical law with that of radioactive disin¬ 
tegration 106 cannot be used as an argument since at that time radioactive 
disintegration was generally considered, as, for example, by Planck, as a 
process involving as yet unknown parameters. Second, the absence of a 
declaration in favor of causality cannot be interpreted as a declaration in 
favor of the absence of causality. Third, referring to the related problem of 
the reaction of an atom or molecule during emission, Einstein stated ex¬ 
pressly that “in the present state of the theory the direction of recoil is only 
‘statistically’ determined,” 107 alluding thereby to the merely preliminary 
character of such an approach. 

Bohr, however, in agreement with his own basic ideas, saw in Einstein’s 
approach a renunciation of any inquiry into the causal structure of transi¬ 
tions. Einstein’s statement that a molecule may pass from one state to an 
energetically lower state “without excitation by an external cause” 108 —a 
statement which, in principle at least, leaves room for the activity of “in¬ 
ner” parameters—was interpreted by Bohr as meaning that the system 
“will start spontaneously to pass to the stationary state of smaller ener¬ 
gy,” 109 where “spontaneously” is taken as essentially synonymous with 
“acausal.” In fact, following Bohr, Einstein’s A n m is often referred to as 
the “probability factor of spontaneous emission” although Einstein never 
used this term in his paper. 

Extending the domain of the applicability of the correspondence prin¬ 
ciple, Bohr associated Einstein’s a priori probabilities with the above- 
mentioned Fourier coefficients in the multiple Fourier expansion of the 
dipole moment, the correspondence being 
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One of the important results which Bohr could derive directly from the 
correspondence principle referred to conditionally periodic systems of/ 
degrees of freedom possessing an axis of symmetry and subject to a central 
force. Calling the separable variables q l9 q 2 q / and taking q x =<p as the 
cyclic variable, Bohr showed that a straightforward application of the 
Hamilton-Jacobi theory 110 led for k = 1,...,/ to the equations x k + iy k 
=fk(q 2>—9 Qf ) el<P 9 where the function f k does not contain the cyclic vari¬ 
able and where x k , y k are the cartesian coordinates of the k th point mass 
in the invariable plane drawn through the center of gravity. For the com¬ 
ponents of the dipole moment, therefore, 
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P x + iP y = £ ± e(x k +iy k )= F { (q 2 ,q f )e iv 


and 


P z = X ez k = F 2^2, -,?/)• 


Since for k = 2, 


or inversely 


fk (*?2> •••> Qfy *^l> •••> '/) 


Qk ^/» *A> •••> //■) 

where w k = v k t + <5* and w (p =vt + 8, the expansion of F x and F 2 into a 
Fourier series yields 


P x + iP y = exp (27rzV/) £ D £ T/ exp [2t77/(t 2 v 2 + * * • + r f v f ) ] 
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and 


P Z =^D ^. T/ exp [ 2 mt{T 2 v 2 + • • • + T f v f ) ] 

r 2 ...r f 

Since r x assumes in the Fourier expansion of P x -h iP y only the value 1 
and does not appear at all in the Fourier expansion of P z and recalling that 
Tj = n\ — n v Bohr derived from the correspondence principle that the 
azimuthal quantum number n x or k, as it was usually denoted, could 
decrease or increase only by one unit for circularly polarized radiation 
whereas it had to remain unchanged for radiation polarized parallel to the 
axis of the system. The correspondence principle, as this calculation 
showed, thus led to a theoretical explanation of certain restrictions in the 
use of Ritz’s combination principle, restrictions which subsequently were 
called “selection rules.” 

One of the examples to which Bohr immediately applied these consid¬ 
erations was Planck’s linear harmonic oscillator, 111 whose mechanical 
frequency of motion may be called v cl . As shown in ref. 13 on page 151, 
its quantum-theoretic energy levels are nhv cl and, according to Bohr’s 
frequency condition, its quantum-theoretic optical frequencies v qu are 
therefore (A«)v cl . Since the Fourier expansion of the linear harmonic 
motion reduces to only one term, namely, that with frequency 1 • v cl , the 
only surviving r is unity and Bohr’s correspondence principle requires 
that A n= ± 1 or v qu = v cl . Now, as a rule, the quantum-theoretic fre¬ 
quency v qu coincides with the classical frequency v cl only in the case of 
high energy levels. For Planck’s oscillator, however, as we see, the equa¬ 
tion v qu = v cl holds strictly for all energy levels, even those characterized 
by the smallest possible quantum numbers. It now became clear that this 
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exceptional property of the harmonic oscillator was a fortunate feature for 
the conceptual development of quantum theory. Otherwise, the theory 
would have had to revoke its own foundations, namely, the radiation law 
for the derivation of which Planck identified the mechanical frequency of 
the oscillator with its optical frequency. 

On the basis of the correspondence principle Kramers, 112 a pupil and 
later associate of Bohr, calculated in his dissertation on the “Intensities of 
spectral lines” the relative intensities of the components of the fine struc¬ 
ture and of the Stark effect with special consideration of the first four 
Balmer lines in the hydrogen spectrum and showed that the theoretical 
results agree surprisingly well with the observational data as known at the 
time. 113 Kramers summarized his conclusions as follows: "... the results 
obtained as regards the applications to the Stark effect and to the fine 
structure of the hydrogen lines must be considered as affording a general 
support of Bohr’s fundamental hypothesis of the connection between the 
intensity of spectral lines and the amplitudes of the harmonic vibrations 
into which the motion of the electron in the atom may be resolved, the 
more so because it seemed possible to obtain a natural understanding of 
certain marked deviations of the observed intensities from the preliminary 
theoretical estimate of the intensity distribution obtained on the basis of 
this hypothesis. It seems therefore justifiable to conclude that Bohr’s con¬ 
siderations offer a sound basis for a further development of the theory of 
intensities of spectral lines.” 114 

The correspondence principle turned out to be a most versatile and 
productive conceptual device for the further development of the older 
quantum theory—and, as we shall see in due course, even for the estab¬ 
lishment of modem quantum mechanics. For periodic and conditionally 
periodic systems it lent additional support to the formulation of the gen¬ 
eral quantum conditions, and this on the following grounds: if for high 
quantum numbers the quantum-theoretic frequency 

AJE 1 dE vA/ 

h h dJ h 

is identified with the classical mechanical frequency rv, then rh = A J 
and, since r is an integer, J = nh up to an additional constant. Even for 
systems which are neither periodic nor conditionally periodic a method of 
quantization was worked out by Smekal 115 on the basis of the principle, in 
contrast to the approach of Bom and Pauli, 116 who based their calcula¬ 
tions on the classical theory of perturbations as developed in astronomy. 
Indeed, in the hands of Bohr and his school the correspondence principle 
was like “a magic wand that allowed the results of the classical wave 
theory to be of use for the quantum theory.” 117 It was due to this principle 
that the older quantum theory became as complete as classical physics. 
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But a costly price had to be paid. For taking resort to classical physics in 
order to establish quantum-theoretic predictions, or in other words, con¬ 
structing a theory whose corroboration depends on premises which con¬ 
flict with the substance of the theory, is of course a serious inconsistency 
from the logical point of view. Being fully aware of this difficulty, Bohr 
attempted repeatedly to show that “the correspondence principle must be 
regarded purely as a law of the quantum theory, which can in no way 
diminish the contrast between the postulates and electrodynamic theo¬ 
ry.” 118 The earliest allusion to such a conception may perhaps be found as 
early as 1921 in a paper 119 in which Bohr briefly discussed the function of 
the principle. In his statement that the principle originated “in an effort to 
attain a simple asymptotic agreement between the spectrum and the mo¬ 
tion of an atomic system in the limiting region where the stationary states 
differ relatively only little from each other,” 120 he apparently tried to 
describe the principle without reference to conceptions extraneous to 
quantum theory. In accordance with his insistence on the irreconcilability 
of quantum theory with classical mechanics and electrodynamics, he re¬ 
garded the correspondence principle as merely affirming a formal analogy 
of heuristic value. Although his numerous and often somewhat conflict¬ 
ing statements, made from 1920 to 1961, on the essence of the correspon¬ 
dence principle 121 make it difficult, if not impossible, to ascribe to Bohr a 
clearcut unvarying conception of the principle, 122 it seems that at the time 
under discussion, that is, in the early twenties, he did not view the princi¬ 
ple as implying the inclusion of classical physics within quantum theory. 
Not only would such a conception have contradicted, of course, his funda¬ 
mental dialectics of the irreconcilability just mentioned, but it would also 
have implied that a quantum-theoretic theorem makes an assertion on 
classical physics. Bohr’s attitude, at least at that time, was well described 
by Kramers 123 when he wrote in 1923: “Bohr expressed himself in his 
talks somewhat as follows: classical as well as quantum theory are each as 
a description of nature merely a caricature; it allows, so to speak, in two 
extreme regions of phenomena an asymptotic presentation of physical 
reality.” 124 

Later, Bohr’s conception of complementarity, which gave a deeper 
significance to his previous ideas on the irreconcilable disparity between 
classical and quantum theory, precluded, now on epistemological 
grounds, the possibility to interpret the correspondence principle as as¬ 
serting the inclusion of classical mechanics within quantum mechanics. 
But at that time as we shall see, the correspondence principle had accom¬ 
plished its major function: it had contributed decisively to the establish¬ 
ment of modem quantum mechanics. 

In 1924 Bohr wrote: “As frequently emphasized, these principles, al¬ 
though they are formulated by the help of classical conceptions, are to be 
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regarded purely as laws of the quantum theory, which give us, not with¬ 
standing the formal nature of the quantum theory, a hope in the future of a 
consistent theory, which at the same time reproduces the characteristic 
features of the quantum theory, important for its applicability, and, never¬ 
theless, can be regarded as a rational generalization of classical electrody¬ 
namics.” 125 About three years later this consistent theory was accom¬ 
plished. In fact, there was rarely in the history of physics a comprehensive 
theory which owed so much to one principle as quantum mechanics owed 
to Bohr’s correspondence principle. 


3.3 The Zeeman Effect and Multiplet Structure 

It was, of course, by no means a pure coincidence in the development of 
the older quantum theory that in the same year in which Schwarzschild 
and Epstein explained the Stark effect Debye and Sommerfeld interpreted 
the normal Zeeman effect. The Zeeman effect not only “opened a new 
world of facts which interest the physicist, the chemist, and even the 
astronomer,” as Becquerel and Deslandres 126 realized in less than two 
years after its discovery, but its study also contributed—to an extent much 
greater than the study of the Stark effect—to the conceptual development 
of quantum theory; we shall therefore discuss the Zeeman effect and the 
problems it raised in greater detail. 

The first to anticipate a possible influence of magnetic forces on spec¬ 
tral lines was Michael Faraday. His philosophy of the essential unity of all 
natural forces and particularly his discovery of the magnetic rotation of 
the plane of polarization 127 in 1845 convinced him of the existence of an 
intimate relationship between optical and magnetic phenomena. It was 
only in 1862, however, five years before his death, that Faraday performed 
an experiment in order to confirm his anticipation: he investigated the 
spectra of sodium, barium, strontium, and lithium with Steinheil’s newly 
invented prismatic spectroscope and an electromagnet. In a letter dated 
March 12, 1862, Faraday described the outcome as follows: “The colour¬ 
less gas flame ascended between the poles of the magnet and the salts of 
sodium, lithium etc., were used to give colour. A Nicol’s polariser was 
placed just before the intense magnetic field and an analyser at the other 
extreme of the apparatus. Then the electro-magnet was made, but not the 
lightest trace of effect on or change in the lines in the spectrum was ob¬ 
served in any position of polariser or analyser.” 128 

In his sketch of Faraday’s life Maxwell mentioned that Faraday “made 
the relation between magnetism and light the subject of his very last ex¬ 
perimental work. He endeavoured, but in vain, to detect any change in the 



Older Quantum Theory 


119 


lines of the spectrum of a flame when the flame was acted on by a powerful 
magnet.” 129 


Tait was probably the first to resume, after Faraday, the study of the 
effect of magnetism on spectra when he discussed, 130 in 1875, the possibil¬ 
ity of an influence of magnetic forces on the absorption of light. Ten years 
later Fievez 131 reported a definite broadening (and partial reversal) of the 
D lines of sodium and of some other principal series lines in the spectra of 
potassium, lithium, and thallium if exposed to magnetic fields. However, 
as a critical examination of Fievez’s description of his experimental meth¬ 
od shows, it is almost certain that in his observations the true effect must 
have been masked by disturbing factors such as pressure broadening. Re¬ 
ferring to Fievez’s experiments, Lodge 132 pointed out:"... it appears likely 
that a variety of unimportant causes of disturbance must have been pres¬ 
ent, and that if the true effect was seen at all, it was so mixed up with 
spurious effects as to be unrecognizable in its simplicity, and so remained 
at that time essentially undiscovered.” It was Pieter Zeeman who resorted 
to a method of experimentation which eliminated such objections. 

In the course of his study of the Kerr effect it occurred to Zeeman in 
1893 to find out “whether the light of a flame if submitted to the action of 
magnetism would perhaps undergo any change.” 133 He therefore studied 
the spectrum of a sodium flame placed between the poles of a Ruhmkorff 
electromagnet—but again without positive results. Two years later, when 
reading Maxwell’s above-mentioned reference to Faraday’s experiment 
on this problem, Zeeman said to himself: “If a Faraday thought of the 
possibility of the above-mentioned relation, perhaps it might be yet worth¬ 
while to try the experiment again with the excellent auxiliaries of spectros¬ 
copy of the present time.” 134 Thus, in August, 1896, in his laboratory at 
Leiden University, he repeated the experiment with a powerful Ruhm¬ 
korff electromagnet, energized by a battery of accumulators (27 amp), 
with a Rowland grating of 10-foot radius and 14,983 lines per inch, and 
with a bunsen flame fed with small quantities of sodium from a piece of 
asbestos impregnated with common salt, and placed between the poles. 
He now found that whenever “the current was put on, the two D lines 
were distinctly widened.” 135 In a second series of experiments Zeeman 
enclosed heated sodium in a porcelain tube, closed at both ends by parallel 
glass plates and continuously rotated round its axis to avoid temperature 
variations. “Excitation of the magnet caused immediate widening of the 
lines,” which regained their original appearance as soon as the current 
was cut off. “It thus appears very probable that the period of sodium light 
is altered in the magnetic field,” concluded Zeeman’s report. 136 Thus ten 
years after Hertz demonstrated the fundamental role of electric and mag¬ 
netic forces for the propagation of light, Zeeman established their role for 
the production of light as well. 
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Lorentz, to whom Zeeman reported the outcome of his experiments 
before their publication, saw at once how to interpret these results on the 
basis of his electron theory. Resolving the motion of the quasi-elastically 
bound charged particle into a linear vibration in the direction of the field 
(and hence unaffected by the field) and into two superposed circular 
motions, executed in opposite directions, in the plane perpendicular to the 
direction of the field, Lorentz predicted that the light from the edges of the 
broadened line, if viewed longitudinally (i.e., in the direction of the lines 
of force), should be circularly polarized, and if viewed transversally (i.e., 
at right angles to the field), should be plane-polarized, a conclusion which 
Zeeman immediately confirmed with the help of a quarter-wavelength 
plate and an analyzer. Lorentz’s classic explanation of the Zeeman effect, 
based on the equation mv 2 /r= kr+ ( e/c ) vH, where H denotes the inten¬ 
sity of the magnetic field, k = and v = lirrv , and leading to 

v = v 0 + eH /4irmc for the frequencies of the two circular motions, was 
published 137 in 1897, at a time when Zeeman had already successfully 
resolved the “broadened line” A 4800 of cadmium into doublets for longi¬ 
tudinal observation and triplets for transversal observation, employing a 
field of about 32,000 gauss and a grating spectroscope. 138 

It soon became clear that Lorentz’s result was but an application of a 
general electrodynamic theorem which Larmor 139 had proved in 1897 and 
two years later employed for the explanation of the Zeeman effect. 140 
According to Larmor’s theorem the effect of a magnetic field on a group of 
rotating ions or electrons for all of which the ratio e/m of electric charge 
to mass has the same value is to superimpose on the orbital motion a 
precessional motion about the direction of the field with angular velocity 
co given by eH /2mc, or, as he expressed it in his Philosophical Magazine 
article, “the oscillation thus modified will be brought back to its original 
aspect if the observer is attached to a frame which revolves with angular 
velocity eH /2mc round the axis of the magnetic field.” 141 Larmor’s proof 
of his famous theorem was based on a straightforward derivation from the 
equation of motion (the term proportional to c~ 2 being neglected), its 
physical meaning being essentially the balancing of the Coriolis force by 
the Lorentz force. Thus, applying the theorem to the above-mentioned 
circular motions and keeping in mind that the third linear vibration was 
unaffected, he obtained immediately the splitting of the line into the Lor¬ 
entz triplet. It is interesting to note that Larmor knew before Zeeman 
published his experiments and before the electron was discovered that a 
magnetic effect would depend on the charge-to-mass ratio of the charged 
particles. Assuming that nothing smaller than an atom could radiate, 
Larmor obtained a very small separation (about 2000 times too small) 
and concluded—just as Voigt did with respect to the Stark effect—that 
the effect would be too small to be detectable. 
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As soon, however, as the electron was discovered and the charge-to- 
mass ratio of cathode rays and P rays established (J. J. Thomson, 1897), 
the agreement of this ratio with e/m as obtained from the analysis of the 
Zeeman effect with known magnetic intensity lent decisive support to the 
hypothesis that the electron was indeed a constituent part of the atom and 
responsible for the emission of light. Moreover, from the fact that the 
shorter wavelength component was circularly polarized in the same sense 
as the current in the electromagnet, Konig 142 and Cornu 143 inferred the 
sign of the charge of the electron (“resinous”). It was also rather fortu¬ 
nate for the development of the theory that Zeeman had been unable to 
resolve the real pattern of the broadened sodium lines but did so with some 
lines in the spectra of cadmium and zinc, lines which belong to the singlet 
systems. Thus, in the fall of 1897 the agreement between experience and 
theory was perfect. 

A few weeks later, however, on December 22, 1897, Preston reported 
to the Royal Dublin Society on his experiments on the Zeeman effect, 
performed with Barrett’s powerful electromagnet of the Royal College of 
Science in Dublin and a concave grating with a radius of 21.5 feet and 
14,438 lines per inch, and declared: “It is interesting to notice that the two 
lines of sodium and the blue line 4800 of cadmium do not belong to the 
class which show as triplets. In fact, the blue cadmium line belongs to the 
weak-middled quartet class, while one of the D lines (D 2 ) shows as a 
sextet of fine bright lines, i.e. form sharp and equally intense lines enclosed 
by two somewhat less sharp on the outside. On the other hand, the other D 
line (Dj) shows as a quartet, not of the weak-middled class, such as 4800 
of cadmium, but of the doublet type, that is, of the type which would result 
from the complete absorption of the central line combined with the sharp 
reversal of each of the side lines.” 144 Preston’s observation of what was 
subsequently called the “anomalous Zeeman effect”—in contrast to the 
“normal effect” which “conforms” to theory—was soon confirmed by 
Cornu, 145 who showed that the central component of the “triplet” of the 
Dj line in the sodium spectrum was really a doublet and that each compo¬ 
nent of the “triplet” of the D 2 line was also a doublet or, in other words, 
that the Dj line was a quadruplet and the D 2 line a sextuplet. From 1898 
on, when Lorentz attempted unsuccessfully to interpret 146 Preston’s and 
Cornu’s observations by generalizing his theory of the normal effect, until 
the end of the older quantum theory, the anomalous Zeeman effect re¬ 
mained an unsolved problem. 

With the introduction of improved optical apparatus with higher re¬ 
solving power it was soon realized, particularly through the work of 
Runge and Paschen in the early years of this century, that the normal 
triplet was the exception rather than the rule. It would lead us too far to 
describe in detail the progress in the study of the anomalous effect, such as 
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Preston’s discovery that analogous spectral lines of the same series, even if 
belonging to different elements, exhibit the same pattern (“Preston’s 
rule” 147 ) or Runge’s empirical law 148 according to which all Zeeman pat¬ 
terns known at the time could be expressed in terms of integral multiples 
of the ratio a/r , where a is the normal triplet interval and r an integral 
number called the Runge denominator. 149 

With the advent of Bohr’s theory of the hydrogen atom and the devel¬ 
opment of the older quantum theory it was, of course, an interesting and 
important question whether the new arsenal of concepts could also be 
applied to the Zeeman effect and if so, whether complete clarification 
could then be achieved. As mentioned in the beginning of the present 
section, Debye 150 and Sommerfeld 151 succeeded in interpreting the nor¬ 
mal Zeeman effect on the basis of the new conceptions. The equations of 
motion for the electron of the hydrogen atom in a homogeneous magnetic 
field H along the z axis are 



mz = 


dV 

dz 


where Vis the electrostatic potential — e 2 /r. The Hamiltonian is given by 
the expression 
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where the coordinates x , y , z and the momenta p x = mx — ( e/2c)Hy , 
Pi — m y + (e/2c)Hx, p 3 = mz are canonical variables. Transforming to 
spherical polar coordinates (point transformation), so that the new ca¬ 
nonical variables are r, 6 , <p,p r >Pe>P<p> Debye obtained the new Hamilto¬ 
nian 
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where the term proportional to c 2 has been neglected. Since <p is cyclic, 
Pep — a 3 (a constant) and the Hamilton-Jacobi equation reads 
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where co = eH /2mc and a x is the negative constant energy. The variables 
being separable, 
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where a\ is the third constant of integration. Integration of 
2 S r r t p r dr = m x h , 2 S d d 2 p d d0 = m 2 h, and = 2ira 3 = m 3 h yields 

(ire 2 3 l2m/yla l — coa 3 ) — 2ira 2 = m x h and 2ir(a 2 — a 3 ) = m 2 h. 


Calling m x + m 2 + m 3 = n, Debye obtained 


a x = 
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and, by Bohr’s frequency condition, 


2ir 2 me 4 ( 1 
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where the primed quantum numbers refer to the final orbit and the dou- 
bleprimed quantum numbers to the initial orbit. The magnetic field, as 
Debye thus could show, modifies the frequency of the emitted spectral line 
by a shift Av = A m 3 • (eH/Airmc) which accounts for the normal Lorentz 
triplet provided A m 3 = ± 1 or 0. That the classic Lorentz theory was able 
to arrive at exactly the same result was readily understood in view of the 
fact that the quantum of action h has canceled out. Sommerfeld, whose 
method of interpreting the Zeeman effect did not differ essentially from 
Debye’s, summarized the situation as follows: “In the present state the 
quantum treatment of the Zeeman effect achieves just as much as Lor- 
entz’s theory, but not more. It can account for the normal triplet... but 
hitherto it has not been able to explain the complicated Zeeman types.” 152 
Nor did Sommerfeld’s relativistic treatment of the Zeeman effect improve 
the situation. 153 


The magnetic quantum number m 3 , as it was called by Sommerfeld, 
coincided, of course, with the equatorial quantum number n x (see page 
97) which had been previously introduced by exactly the same definition. 
As n x = k cos a, the 77-dependent term in the energy expression, namely, 
— m^co^Tr, could be written as — ( kh /2ir) (e/2mc)H cos a or, in vec¬ 
torial notation, — (|x*H), which was known to be the interaction energy 
of a magnetic moment |x in a magnetic field H. That kh /2ir-e/2mc is 
indeed the magnetic moment generated by the circling of an electron of 
mechanical angular momentum p ^ = kh /2it followed easily, as is well 
known, from classical electrodynamics. The magnetic moment due to the 
electron’s orbital motion in the smallest of the Bohr orbits in the hydrogen 
atom ( k = 1), i.e., eh /4irmc , was called by Pauli 154 in 1920 the “Bohr 
magneton // 5 ”—in contradistinction to the “Weiss magneton,” which 
Weiss, 155 in the course of his investigations on ferromagnetics, had intro¬ 
duced in 1911 as the greatest common divisor of the magnetic moments of 
molecules. Although the “Weiss magneton” was approximately five times 
smaller than the “Bohr magneton,” it was the latter, and not the former, 
which was soon recognized as the natural unit of magnetic moment. 
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The problem of the anomalous Zeeman effect was early recognized as 
intimately related to the question concerning the origin of multiplet struc¬ 
tures of spectral lines. For, as spectroscopic evidence clearly showed, all 
components of a multiplet exhibit the anomalous Zeeman effect whereas 
singlets show only the normal effect. More generally, the type of the Zee- 
man pattern of a component evidently depended upon the multiplicity 
with which it was associated. Thus, the solution of one of these problems 
seemed to be the solution of both. In the course of these investigations new 
concepts had to be introduced which proved of fundamental importance 
for the further development of the theory. It is primarily for this reason 
that the study of multiplets and the anomalous Zeeman effect will be 
discussed in the sequel. 

The application of Ritz’s conception of terms to the alkali spectra and 
their line series led, as we have seen, to the conclusion that the terms of the 
sharp series were given by Rc/(n + s) 2 , those of the principal series by 
Rc/(n + p x ) 2 or Rc/(n + p 2 ) 2 , those of the diffuse series by Rc/(n + d) 2 
and so on. The so-called “Rydberg corrections” s,p x ,p 2 , d,.. .measured the 
deviation of each term from its analogue in the hydrogen spectrum. Ac¬ 
cording to Bohr’s theory the alkali spectra originate by a mechanism 
which differs from that of the hydrogen spectrum essentially only in one 
aspect: the optical electron (valence electron) moves in a field which, 
owing to the influence of the inner or core electrons, is not strictly Cou- 
lombian. Nonpenetrating orbits, that is, those of small eccentricity, had to 
be characterized by small Rydberg corrections since for them the condi¬ 
tions of the hydrogen atom were approximately satisfied. 

As Sommerfeld had shown, the orbits could be classified by two quan¬ 
tum numbers: the principal quantum number n( = n' + k), defining the 
main energy level, and the azimuthal quantum number k , defining a suble¬ 
vel of the main energy level. Whereas n was any positive integer, the 
associated k was restricted to the condition n^k> 0 and subject to the 
selection rule | A A: | = 1 . In accordance with the geometrical interpretation 
of k /n for the hydrogen atom as the ratio of minor to major semiaxis of the 
orbital ellipse, the orbit of greatest eccentricity—and hence with the 
greatest Rydberg correction, which on spectroscopic evidence was s —had 
to be assigned the smallest possible value of k , that is, k = 1. Vice versa, 
the greatest possible value of k , that is, k = n , had to be assigned to the 
term with the smallest Rydberg correction for a given n. Terms for which 
k = 1,2,3,4,... were called s,p, terms 156 in accordance with the first 

letter in the name of the corresponding (sharp, principal, diffuse, funda¬ 
mental) series. 

The question was now raised whether the two quantum numbers n and 
k were sufficient to characterize all known terms or energy levels. The 
introduction of the magnetic quantum number already showed that the 
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answer had to be negative. Moreover, it was even at that time well estab¬ 
lished that in the alkali spectra the principal series consisted of doublets 
with decreasing separation as n increased, whereas the lines of the sharp 
series were doublets of constant separation. Resolution with high-disper- 
sion spectrometers revealed that also the diffuse and fundamental series 
consisted of doublets with so-called satellites in addition to the chief lines. 
It was therefore obvious that the principal, diffuse, and fundamental 
terms themselves were doublets and the sharp terms simple or singlets. 
Similar spectroscopic evidence showed that the terms in the alkaline-earth 
spectra were singlets and triplets, with the exception of the s terms, which 
were always only singlets. 

Until 1922 only singlet, doublet, and triplet terms were known. The 
conception of mutiplicity was considerably extended in 1923, particularly 
through Catalan’s 157 investigations of the spectra of manganese and chro¬ 
mium. It became clear that complex terms up to octets do exist and that in 
such complex-term systems the multiplicity of levels for the sequence of s , 
p , d terms, etc., begins with 1 for the s terms and increases always by two 
units for the subsequent terms until the “permanent” level number, or 
“permanence number” (“Permanenznummer”) as Sommerfeld called it, 
was obtained which gave the system its name (e.g., “sextet system”). 
Thus, for a sextet, for example, the multiplicities of the s , p , d,f g terms 
were found to be, respectively, 1, 3, 5,6, 6. Groups of spectral lines which 
arise from transitions between such complex-term systems were called 
“multiplets,” as suggested by Catalan. 158 It was now also an established 
fact that atoms of the elements in each column of the periodic table have 
characteristic multiplicities: doublets for the alkali elements, singlets and 
triplets for the alkaline-earth elements, doublets and quartets for the ele¬ 
ments of the third column, and so on. 

Spectroscopic evidence now showed that in such multiplets—as, for 
example, in the group of lines corresponding to transitions between p and 
d terms—not all the lines are present which are allowed by Ritz’s combi¬ 
nation principle. This nonappearance of certain lines was for Sommer¬ 
feld 159 an important clue. For it suggested to him the operation of an as yet 
undetermined selection principle which prohibits the appearance of lines 
otherwise permissible. In order to discover this selection principle, he 
assigned tentatively to each spectral term a qualifying number n j which he 
called “inner quantum number” and which he later, following a sugges¬ 
tion by Bohr denoted by the letter j. Each term was thus characterized by 
three quantum numbers and, as Sommerfeld proposed, was denoted as, 
for example, in the case n = 5, k= l,y = 3, by the notation 5 s 3 . Analyzing 
the doublets and triplets of diffuse series, Sommerfeld found it possible to 
assign numerical values to the inner quantum number so as to establish a 
selection rule A/ = ±1 or 0. Lande 160 soon adduced additional evidence 
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to the effect that the transition j = 0 toy = 0 has to be excluded. In the case 
of odd multiplicities Sommerfeld assigned to s terms j = 0 for singlets, 
j= 1 for triplets,./ = 2 for quintets, etc. to p terms j = 1 for singlets, j = 2, 
1, 0 for triplets j = 3, 2, 1 for quintets, etc., to d terms j = 2 for singlets, 
j — 3,2,1, for triplets j = 4, 3, 2,1,0 for quintets, and so on. Once the full 
number of the multiplicity has been reached—which occurs for each se¬ 
ries with the appearance of j = 0—no further increase in the number of j 
values takes place. Sommerfeld was fully aware that the assignment of j 
values as proposed was not uniquely determined since the addition of an 
arbitrary constant did not violate the prescribed selection rule involving 
only differences in j. For even multiplicities Sommerfeld proposed a simi¬ 
lar scheme in which, however, as we shall see later on, the values of j had to 
be half integral. In any case, the selection rule did not so far throw any 
light on the physical significance of the inner quantum number, which, for 
the time being, was only a numbering device. 

Now, as a rule, all multiplet spectral lines exhibited an anomalous 
Zeeman effect. On the basis of van Lohuizen’s 161 investigations and the 
combination principle it was clear that this line splitting had its origin in a 
corresponding splitting of the multiplet energy levels. Thus it was obvious 
that further progress was contingent upon obtaining a successful explana¬ 
tion of the Zeeman splitting of multiplet levels. 

To this end Sommerfeld 162 and Lande 163 formulated, in agreement 
with the empirical data available at that time, what has been called the 
“magnetic-core hypothesis.” According to this hypothesis the atomic 
core, i.e., the nucleus and the inner (nonoptical) electrons, possesses an 
angular momentum of s units of h /2i t and a corresponding magnetic mo¬ 
ment. The latter produces an axially symmetric magnetic field whose 
symmetry axis coincides with the direction of the core’s angular momen¬ 
tum. In other words, the optical electron is subject to what might be called 
an internal Zeeman effect, its angular-momentum vector being allowed to 
assume only discrete inclinations with respect to the axis of the core. 

For the sake of historical accuracy it should be pointed out that an 
interpretation of doublets and triplets as due to an internal Zeeman effect 
had already been proposed in 1919 by Roschdestwensky (Rojdest- 
vensky). 164 On the basis of a theory, specially construed for this purpose, 
Roschdestwensky calculated the doublet separation of the first line in the 
principal series of lithium and found a value about five times too large. His 
theory, as he later realized himself, was incompatible with Bohr’s corre¬ 
spondence principle, of which he knew nothing because of the political 
situation at the time. 

In analogy with the vectorial representation of angular momenta in 
classical mechanics which Lande 165 had adopted for quantum theory as 
early as 1919, the angular-momentum vector R of the core (“Rumpf” 
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= trunk) was assumed to combine with the angular-momentum vector K 
of the optical electron to form a resultant vector J, the total angular mo¬ 
mentum of the atom. In accordance with classical mechanics R and K 
were assumed to precess around J and the latter, in the presence of an 
external magnetic field, was described as oscillating, like a gyroscope, 
about the direction of the field. The magnitude of J depended, of course, 
not only on the size of the components K and R but also on the angle 
between them. In fact, the different orientations of K with respect to the 
axis of R were correctly interpreted by Lande as the cause of the energy 
differences among the various sublevels of the same multiplet level. It was 
easy, of course, to see that the magnetic interaction between two magnetic 
moments is proportional to the scalar product of the respective angular 
momenta. Finally, Sommerfeld’s inner quantum number j was identified 
with the magnitude of J in units of h /277-and the physical significance of y 
as representing the total angular momentum of the atom was thereby 
established. 

The vector model, however, soon gave rise to serious difficulties. Ify, /, 
and s express, 166 in units of h/2ir , the magnitudes, respectively, of the 
vectors J, K and R the geometry of the vector diagram implied that 
| / — s\<j<, | / + s\. Thus, ify, /, and s are integers, the inner quantum num¬ 
ber y, for given values of / and s , can obviously assume 
r=l + s— \l — .s| + l different values. For /<.?, therefore, a multiplet 
level should be composed of 2/ + 1 sublevels and for l^s of 2s + 1 sublev¬ 
els. Since for l<,s the number of levels increases with increasing /, the 
multiplicities r =21 + 1 of s,p, d,... levels reveal immediately the magni¬ 
tude / of the corresponding angular-momentum vector K. In accordance 
with the accepted model of the atom one had to expect, of course, that for s 
levels / = 1, for p levels 1=2, etc. Spectroscopic experience, however, 
showed irrefutably that, contrary to these predictions, for s terms 
(k = 1), being singlets (i.e. r = 1), / = 0, and quite generally l = k — 1. 
For a given principal quantum number n the number /, it became clear, 
could assume the values 0,1,..., n — 1. The older quantum theory could 
never resolve this inconsistency. 

It should be clear that Sommerfeld’s assignment of quantum numbers 
was purely empirical and open to modifications. In fact, a different assign¬ 
ment which proved useful for many purposes and was widely adopted was 
proposed by Lande 167 as follows: R = ...for singlet, doublet, triplet,... 
systems, K = ^,^,...for s p , d,... states, and J, half integral for odd multi¬ 
plicities and integral for even multiplicities, as the quantum number be¬ 
longing to the resultant of the vectors associated with R and K. Obviously, 
R=s + ±tmdK=l +i. 168 

Lande 169 applied his scheme to his penetrating analysis of the anoma¬ 
lous Zeeman effect and arrived at an important, though not complete, 
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kinematical interpretation of this phenomenon. In the normal Zeeman 
effect, it will be recalled, the displacement of a resolved level was given by 
the expression m 3 hco/2ir t where co = eH/2mc and where, in accordance 
with the correspondence principle, a transition with A m 3 = 0 corre¬ 
sponds to a line component polarized parallel to the field (ir component) 
and A m 3 = + 1 to a polarization normal to the field (a component). 

It will also be recalled that the displacement or energy shift of a multi- 
plet line component in the anomalous Zeeman effect could be expressed 
according to Runge as an integral multiple of a/r , where r is an integer and 
a = h(o/2ir (see page 122). Since the resolution of a multiplet component 
led always (in the cases under consideration) to a symmetric pattern, a 
specification of one-half of the (transversal) pattern contained all the 
information needed. A Zeeman pattern could thus conveniently be de¬ 
scribed by writing the Runge denominator r under a horizontal line and 
above it, as numerators, separated by commas, the displacements of the 
various spectral components in units of a y i t components usually being 
enclosed in parentheses. 170 Now, just as in the discussion of the normal 
Zeeman effect of the hydrogen atom (page 123 ) m 3 was the projection of k 
on the direction of the external field, a quantum number m was introduced 
to characterize the projection of J on the direction of the field. In accor¬ 
dance with Lande’s notation the numerical value of m, which for even 
multiplicities had to be half integral and for odd multiplicities integral, 
was subject to the condition — |/—^|<m< \J— so that rn max 
= \ J — . Lande now assigned to each multiplet level appropriate m val¬ 
ues in agreement with the selection and polarization rules. Thus, in the 
case of the (four-line) Zeeman pattern of the Dj line of sodium, a line 
which according to Lande’s notation 171 originates by a transition from the 
level 2 S X to 2 P lf these terms were each given the two m values — \ and \\ 



Vertical arrows indicate ir components, 
oblique arrows indicate a components 


Similarly, the m values assigned to the 2 P 2 term (according to Lande’s 
notation) from which the transition associated with the D 2 line initiates 
were — \ 9 — By assigning to each unresolved multiplet level a set of 
integral space m values, Lande could account for the number of lines and 
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their polarizations in the Zeeman pattern in accordance with the selection 
rule A m = + 1 or 0. But in order to characterize also the displacement of 
the various Zeeman terms, an additional set of numbers was required. 
These numbers in units of a give the energy difference between the Zee- 
man level under discussion and the unresolved multiplet level. For the D 
lines of the sodium spectrum Lande found the following values: 



+ < 2 >» 4 
- 3 


+ (1). 3,5 
3 


That is, with 2 S X and m = — \ he associated — ], with 2 S X and m = \ he 
associated ], and so on. Analyzing this empirical array, Lande recognized 
that the new numbers assigned to each term were just the products of the 
m value by a constant number g which was independent of m but depend¬ 
ed on the numbers characterizing the unresolved level ( 2 S X or 2 P X or 2 P 2 in 
the case of the D lines). More specifically, Lande showed that the empiri¬ 
cal formula 



or 


g= 1 + 


J 2 + R 2 -K 2 

2J 2 


where J denotes yj(J+ \)(J — {) and similarly for R and AT, describes 
exactly the dependence of g , which he called the “splitting factor,” on R, 
K y and J. For 2 S X (R = \, K = J = 1), for example, g = 2, in agreement 
with the newly assigned numbers for this term. 

In his search for a theoretical explanation of the splitting factor Lande 
reverted to the vector diagram, replaced R, K y Jby R f K, J, and deduced 
from the preceding expression for g that g = 1 + [R cos(/, R )/J] . Since 
m=J cos (/, H ), where H indicates the direction of the external field, he 
obtained mg = J cos (/, H) + R cos (/, R) cos (/,/f), and as the product 
of the last two cosine functions is precisely the average of cos (R f H), 

mg = K cos (Ky H) + 2 R cos (R f H) , where bars over the cosine func- 
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tions denote average values over complete cycles. The result differed from 

mg = K cos ( K , H ) + R cos ( R , H ), which should have been expected 
were Larmor’s theorem applicable to R and K . 

Comparing the last two equations, Lande suggested that the appear¬ 
ance of the splitting factor g might be due to an anomalous ratio of the 
mechanical angular momentum of the core to its magnetic moment, to a 
ratio which is half the value to be expected on classical grounds. He pro¬ 
posed therefore to “modify” Larmor’s theorem but admitted the lack of 
any convincing theoretical explanation for such a “modification.” 172 Lan- 
de’s perspicacious analysis of the anomalous Zeeman effect thus led to a 
spectroscopic confirmation of the anomalous gyromagnetic ratio whose 
existence after Beck’s 173 careful experiments on the “Einstein-de Haas 
effect” could no longer be doubted. Like Beck, Lande envisaged a rotation 
of the core or nucleus, in addition to the Larmor precession, as a possibil¬ 
ity worthy of further investigation. 

Another obscure feature which Lande’s analysis involved was his in¬ 
troduction of the geometric means of angular momenta, the introduction 

of yj(J + for J, or in Sommerfeld’s notation 174 of v (J + 1 )j for 

y, of whose legitimacy Lande entertained serious doubts in view of their 
“irrationality in the scale of quanta.” However, he felt justified in publish¬ 
ing these results “because they may serve as a significant clue toward a 
final solution of the problem of the anomalous Zeeman splitting.” 175 


3.4 Exclusion Principle and Spin 

The Sommerfeld-Lande magnetic-core theory of the fine structure of 
spectral lines seemed to accord completely with the outcome of an impor¬ 
tant experiment which Otto Stem 176 proposed in 1921 as a test of Som¬ 
merfeld’s space quantization in a magnetic field. Since Dunoyer’s 177 pio¬ 
neer work molecular beams had been investigated to some extent, but 
mainly in connection with problems of gas kinetics. The development of a 
general method for the study of molecular beams and its application to 
atomic physics was due mainly to the work of Stem, which he carried out 
first in his laboratory at the University of Frankfurt, in collaboration with 
Gerlach, and subsequently at the Institute for Physical Chemistry in 
Hamburg, to whose chairmanship he was appointed in 1923. In the first 
experiment performed at the end of 1921, Stem and Gerlach 178 produced 
a narrow beam of silver atoms by evaporation in a heated oven and direct¬ 
ed the beam in a high vacuum (10 -4 to 10 -5 mm Hg) through collimating 
slits along the strong field gradient at the sharp edge of the pole piece of a 
DuBois electromagnet. Impinging on a plate of glass, the atoms formed a 
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thin deposit. A broadening of the deposit on the target was clearly detect 
able. In a second series of experiments, taking exposures of eight hours 
and more, Stem and Gerlach 179 established unequivocally that the atomic 
beam split in the magnetic field into precisely two beamlets. From the 
geometry of the apparatus and from a careful measurement of the gradient 
of the magnetic-field strength they could even confirm that each silver 
atom possesses a magnetic moment in the direction of the field of the 
amount of one Bohr magneton with an error of not more than 10 percent. 
Similar experiments were subsequently carried out also with other atoms, 
such as hydrogen, sodium, potassium, cadmium, thallium, zinc, copper, 
and gold. 180 The conspicuous absence of any silver atoms at the center of 
the deflection pattern in the Stem-Gerlach experiment was a striking 
confirmation of space quantization. For according to classical physics, 
which viewed atoms as elementary magnets susceptible of assuming a 
continuum of orientations, the pattern should have exhibited a Gaussian 
distribution with its maximum at the center, the position of the undeflect¬ 
ed beam. But in the Stem-Gerlach experiment no undeflected atoms were 
detectable at all! Moreover, if measurements of specific heats are ignored, 
it was the first experiment of a nonoptical nature with no reference to 
radiative energy to confirm a fundamental conclusion in quantum theory. 
Indeed, it was a triumph for quantum theory and Sommerfeld could right¬ 
fully declare: “With their bold experimental method Stem and Gerlach 
demonstrated not only the existence of space quantization, they also 
proved the atomistic nature of the magnetic moment, its quantum-theore¬ 
tic origin and its relation to the atomic structure of electricity.” 

Now, it was known from spectroscopic evidence that the ground state 
of silver is an s state belonging to a group whose multiplicity is 2. Hence, 
according to the Sommerfeld-Lande theory it followed from the equation 
2 = 2s + 1 that the angular momentum of the core and, since j a = 0, also 
the total angular momentum of the atom is \ in units of h /2i r. The compo¬ 
nents of the total angular momentum along the direction of the field were 
therefore + \ and — \ and the beam should split into exactly two beam- 
lets—in full agreement with the experiment. 

Yet in spite of this apparently 181 perfect agreement with theory, the 
Stem-Gerlach experiment, when scrutinized closely, led to grave difficul¬ 
ties. In fact, Einstein and Ehrenfest, 182 in a penetrating discussion of the 
possible mechanisms of alignment of the atoms in the field, pointed out 
that no satisfactory answer could be given to the question: “How do the 
atoms take up their orientations?” The high vacuum of the field eliminat¬ 
ed the possibility of explaining the process in terms of collisions and radia¬ 
tive energy exchanges could not be considered for this purpose since they 
require 10 9 to 10 11 sec whereas the alignment occurred in less than 10 -4 
sec. Only two alternatives were left: either “the real mechanism is such 
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that atoms never assume states in which they are not completely quan¬ 
tized” or “at rapid changes states occur which, with respect to the orienta¬ 
tion, violate the quantum rules.” Either assumption, they showed, leads to 
insurmountable conceptual difficulties, such as to a violation of the energy 
principle or to the conclusion that only systems capable of emitting radi¬ 
ation can be quantized. Although the deficiencies of proposed models of 
atoms and of suggested pictorial representations of atomic processes had 
been repeatedly stressed before, the present paper by Einstein and Ehren- 
fest was an early presentation of the conceptual impasse encountered in 
any attempt at forming a picture of the precise behavior of atoms. It is 
particularly interesting to note that this “writing on the wall,” years be¬ 
fore the advent of modem quantum mechanics, had as its coauthor Albert 
Einstein, who persistently refused to accept the basic tenets of this greatest 
departure from classical idealization and who, in particular, renounced its 
aspect of complementarity which disclaims the possibility of universally 
consistent models because of the need of contradictory descriptions. 

The theory which predicted space quantization thus, as we see, brought 
upon itself—through the very experimental verification of its predic¬ 
tion—insuperable difficulties. But more trouble lay ahead. 

The difficulties to be faced in solving the Zeeman effect, on the one 
hand, and the importance of a satisfactory explanation of precisely this 
phenomenon for the progress of theoretical physics, on the other, were 
already recognized by Wolfgang Pauli in his early days. After receiving 
the Ph.D. degree at the University of Munich, where he had studied under 
Sommerfeld, he spent the winter semester 1921/22 in Gottingen as an 
assistant to Bom. On the importance of this period of his future work 
Pauli wrote as follows: “A new phase of my scientific life began when I 
met Niels Bohr for the first time. This was in 1922, when he gave a series of 
guest lectures at Gottingen, 183 in which he reported on his theoretical 
investigations on the periodic system of elements. 1 shall recall only briefly 
that the essential progress made by Bohr’s considerations at that time was 
the explaining by means of the spherically symmetric atomic model the 
formation of the intermediate shells of the atom and the general properties 
of the rare earths. The question as to why all electrons for an atom in its 
ground state were not bound in the innermost shell had already been 
emphasized by Bohr as a fundamental problem in his earlier works. In his 
Gottingen lectures he treated particularly the closing of this innermost K 
shell in the helium atom and its essential connection with the two non¬ 
combining spectra of helium, the ortho- and parahelium spectra. How¬ 
ever, no convincing explanation for this phenomenon could be given on 
the basis of classical mechanics. It made a strong impression on me that 
Bohr at that time and in later discussions was looking for a general expla¬ 
nation which should hold for the closing of every electron shell and in 
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which the number 2 was considered as essential as 8, in contrast to Som- 
merfeld’s approach.” 184 

In the fall of 1922 Pauli accepted Bohr’s invitation to visit Copenhagen 
and to assist him in a German edition of his works. It was on this occasion 
that Pauli made his first serious effort to explain the anomalous Zeeman 
effect. In an address 185 delivered in 1945 at the Princeton Institute for 
Advanced Study, at a dinner given in his honor on the occasion of his 
Nobel Prize award (1945), Pauli referred to this visit and recalled how 
frustrated he had felt at that time. “The anomalous type of splitting was on 
the one hand especially fruitful because it exhibited beautiful and simple 
laws, but on the other hand it was hardly understandable, since very gen¬ 
eral assumptions concerning the electron, using classical theory as well as 
quantum theory, always led to the same triplet. A closer investigation of 
this problem left me with the feeling that it was even more unapproach¬ 
able. A colleague who met me strolling rather aimlessly in the beautiful 
streets of Copenhagen said to me in a friendly manner, ‘You look very 
unhappy’; whereupon I answered fiercely, “How can one look happy 
when he is thinking about the anomalous Zeeman effect?”’ 

In a paper 186 written in Copenhagen and in a sequel 187 written in Ham¬ 
burg, where in 1923 he accepted the position of Privatdozent, Pauli gener¬ 
alized certain results of Sommerfeld’s treatment of the Zeeman effect of 
the alkali and alkaline-earth spectra and derived the values of certain 
multiplet terms previously investigated by Lande. For this purpose he 
associated without theoretical justification 188 appropriate magnetic mo¬ 
ments with the angular momenta of the atom and succeeded in tracing the 
complete transition of the splitting from the case of weak fields to that of 
strong fields. However, convinced of an intimate relation between the 
theory of multiplet structure and the problem of the building up of the 
periodic system of elements, Pauli did not regard his work as final or 
conclusive. Referring 189 to his inaugural lecture at the University of Ham¬ 
burg, he recalled: “The contents of this lecture appeared very unsatisfac¬ 
tory to me, since the problem of the closing of the electronic shells had 
been clarified no further. The only thing that was clear was that a closer 
relation of this problem to the theory of multiplet structure must exist. I 
therefore tried to examine again critically the simplest case, the doublet 
structure of the alkali spectra. I arrived at the result that the point of view 
then orthodox—according to which a finite angular momentum of the 
atomic core was the cause of this doublet structure—must be given up as 
incorrect.” 

In fact, in the fall of 1924 Pauli advanced some striking arguments 
against this “ orthodox” theory or, as we called it, the magnetic-core 
theory, and showed its inconsistency with experience. Calculating 190 the 
relativistic mass variation of A'-shell electrons in homologous elements, 
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Pauli found that the ratio of the magnetic moment to the angular momen¬ 
tum, as obtained on the basis of the core theory, had to be multiplied by a 
correction factor 7 , namely the time-average value of (1 — v 2 /c 2 ) * taken 
over a complete orbit of the revolving electron. He found 
that for Ba(Z = 56)7 = 0.924, for Hg(Z = 80)7 = 0.817, for 
T1(Z = 81)^ = 0.812. As these results exemplify, the above-mentioned 
ratio (i.e., the inverse gyromagnetic ratio), according to the core theory, 
had to be a slowly decreasing function of the atomic number; hence Lan- 
de’s g factor and the Zeeman splitting had to depend on the atomic num¬ 
ber, a result contrary to experience. 

In view of the incontestable absence of such a dependence and in view 
of the completely unexplained exceptional position of the K shell which 
the core theory regarded as possessing nonzero momentum in contrast to 
all other closed shells, Pauli rejected the assumption that an inner closed 
shell participates at all, through its contribution to the core momentum, in 
the formation of the multiplet structure of optical spectra and their Zee- 
man splittings. “We incline therefore,” he declared , 191 “to doubt the cor¬ 
rectness of the contention that rare gas configurations partake in the com¬ 
plex structure and the anomalous Zeeman effect in form of core 
momenta.” Assuming that for a closed shell both the angular momentum 
and the magnetic moment vanish, Pauli concluded that in the case of the 
alkali atoms the angular momenta of the atoms as well as their energy 
changes in an external magnetic field are due to the valence electron alone. 
“In particular, the angular momenta of alkali atoms and their energy 
changes in an external magnetic field have to be considered as due essen¬ 
tially to the exclusive action of the optical electron which also has to be 
regarded as the source of the magnetomechanical anomaly. The doublet 
structure of the alkali spectra as well as the deviation from Larmor’s 
theorem are due, according to this view, to a peculiar, classically not 
describable two-valuedness (“eine eigentiimliche, klassisch nicht be- 
schreibbare Art von Zweideutigkeit”) in the quantum-theoretic proper¬ 
ties of the optical electron .” 192 

Pauli thus showed that the relativistically refined core theory was in¬ 
compatible with experience. In this context it should perhaps be recalled 
that ever since Pauli wrote, at the age of twenty, at Sommerfeld’s request, 
his comprehensive article on the theory of relativity for the Encyklop'adie 
der mathematischen Wissenschaften , 193 he was not only an expert on the 
theory and a master of its technique but also an ardent advocate of its 
fundamental tenets, which he unhesitatingly espoused. In fact, it was Pau- 

(r 

li’s strong conviction in the absolute validity of the theory of relativity 
which made him reject the “orthodox” core theory and thus prepared the 
way for the conception of the spin. 
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In the fall of 1924, just when Pauli advanced these arguments against 
the magnetic-core theory, Edmund C. Stoner’s classic paper on “The dis¬ 
tribution of electrons among atomic levels” 194 appeared and gave Pauli an 
important clue for the progress of his work. For a full understanding of the 
conceptual situation at that time some details concerning the theory of the 
shell structure of the atom have to be recalled. 

Since from optical spectra conclusions could be drawn only regarding 
the processes taking place in the exterior portions of the atom and since 
the study of the constitution of atoms had to start from the inner regions 
for its explanation of the building-up process, it was only natural that the 
theory of the shell structure had its beginning in the study of the x-ray 
spectra, the most important source of information concerning the internal 
structure of the atom. In 1911 Barkla, 195 in his work on x-rays, introduced 
the notion of K y L,...“series of radiation.” Two years later Moseley 196 
published the results of his famous experiments according to which “ev¬ 
ery element from aluminium to gold is characterized by an integer N 
which determines its X-ray spectrum...this integer N , the atomic number 
of the element, is identified with the number of positive units of electricity 
contained in the atomic nucleus...,” and according to which “the frequen¬ 
cy of any line in the X-ray spectrum is approximately proportional to 
A(N — b ) 2 , where A and b are constants.” 

When Kossel 197 soon afterward explained these experimental results 
on the basis of Bohr’s model of the atom, he connected Barkla’s energy 
levels with different groups of electrons in the atom, and since then the 
same letters K , L, A/,...have been used to denote the various groups or 
shells of electrons in the atom. According to Kossel’s theory the emission 
of characteristic x-ray lines is caused by the transition of electrons from 
higher quantum orbits to replace ejected inner electrons; his theory thus 
accounted for the fact that—contrary to optical spectra—characteristic x- 
ray lines manifest themselves only in emission and never in absorption 
spectra. It is obvious that Kossel’s theory was already based on the as¬ 
sumption, although not explicitly stated, that to every inner orbit or shell 
there corresponds a maximum number of electrons and that each shell 
becomes particularly stable when occupied by this maximum number of 
electrons. 

It was primarily Bohr who on the basis of his previous work succeeded 
in the early twenties in formulating an acceptable theory of the building 
up of the whole periodic system of elements. In agreement with Kossel’s 
ideas Bohr 198 proposed in 1921 the following distribution of electrons. 
Referring to the closed shells of the inert gases, he declared: “For the 
atoms of these elements we must expect the constitutions indicated by the 
following symbols: Helium (2J, Neon (2 X 8 2 ), Argon (2 1 8 2 8 2 )..., where 
the large figures denote the number of electrons in the groups starting 
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from the innermost one, and the small figures the total number of quanta 
characterising the orbits of electrons within each group.” 

Using all the evidence furnished by physics and chemistry at that time 
and employing half theoretical and half empirical methods, Bohr and 
others suggested more detailed schemes of distribution. The first attempt, 
of course, was to consider also the azimuthal quantum number k which, 
being subject to the condition 0 < &<«, would have led to the conclusion 
that K terms, the energy terms corresponding to n = 1 and k = 1, are 
single, L terms, corresponding to n = 2 and k = 1 or 2, are doublets 
(which, as we have seen, should remain unresolved in the case of a perfect¬ 
ly Coulombian field), M terms triplets, etc. The analysis of x-ray spectra, 
however, showed that L terms are triplets, M terms quintets, etc. It was 
therefore obvious that the two quantum numbers n and k were insufficient 
for a classification of the electrons in their orbits. Introducing the inner 
quantum number 199 y(( j + i)) which, for doublets, is equal either to k or 
to k— 1 (for 5 termsy is always equal only to k), Lande proposed in 1922a 
scheme 200 which assigned to each sublevel a unique set of the three quan¬ 
tum numbers n , k,j((n , / + 1 ,y + i)): 
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Thus, for example, for the electrons in sublevel L u : n = 2, k = 2, and 

7=1. 

For the distribution of the electrons with respect to the different shells 
the scheme proposed by Bohr and Coster 201 was still generally accepted. 
The table on page 137—it suffices for our purpose to give only its begin¬ 
ning—indicates this scheme. 

Meanwhile, however, careful studies of the x-ray absorption bands and 
precision measurements of the relative intensities of x-rays (e.g., of the 
K ai line, corresponding to the transition L lu ->AT, and of the K a2 line, 
corresponding to L u -+K) were carried out, and from these additional 
information could be inferred. The results of these experiments performed 
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primarily by de Broglie and Dauvillier , 202 proved to be incompatible with 
the assumed distribution. Full agreement between theory and experience 
was achieved only when Stoner proposed his well-known scheme of elec¬ 
tron distribution in the article which was so influential in Pauli’s work. 
Even prior to its theoretical vindication by quantum mechanics Stoner’s 
scheme was considered as an important improvement over Bohr’s. Som- 
merfeld, who immediately after its publication advocated its use both in 
his scientific papers and in the subsequent editions of his Atombau und 
Spektrallinien , characterized its advantages in 1925 as follows: “Being 
based on the incontestable experience as to number and order of x-ray 
levels and on the association of quantum numbers with these, Stoner’s 
scheme is much more trustworthy (“vertrauenserweckend”) than 
Bohr’s. It has an arithmetic rather than a geometric-mechanical charac¬ 
ter; without assuming any symmetry of orbits it exploits not some, but all 
available data of x-ray spectroscopy .” 203 

What, then, was Stoner’s new idea and “arithmetic” approach? Refer¬ 
ring to Lande’s classification, Stoner declared: “The number of electrons 
in each completed level is equal to double the sum of the inner quantum 
numbers as assigned, there being in the K,L,M,N levels, when completed, 
2,8( = 2 + 2 + 4),18(=2 + 2 + 4 + 4 + 6 ),...electrons. It is suggested 
that the number of electrons associated with each sub-level separately is 
also equal to double the inner quantum number .” 204 Thus, according to 
Stoner who called the azimuthal quantum number k x and the inner quan¬ 
tum number k 2 , the L x sublevel, characterized by n = 2, k x = 1, and 
k 2 = 1 ((« = 2 , / = 0 ,y = £)), contains lk 2 = 2 electrons, theL n sublevel, 
characterized by n = 2 , k x = 2 , and k 2 = 1 , contains 2 k 2 = 2 electrons, 
and the L m sublevel, characterized by n = 2, k x = 2, and k 2 = 2, contains 
2 k 2 = 4 electrons, in striking contrast to Bohr’s classification as proposed 
in his summarizing article “Linienspektren und Atombau ” 205 which ap¬ 
peared in 1923. A greater concentration of electrons in the outer sub¬ 
groups and, accordingly, the closing of the inner subgroups at an earlier 
stage distinguished Stoner’s from Bohr’s scheme. The total number of s.p. 
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tf,...electrons in each shell—in other words, the maximum capacity for 
such electrons in each shell—is given, as in Bohr’s scheme, by 4k x — 2 
(( 2 ( 2 / + 1 ))), in accordance with the equation 22k 2 = 4 k x — 2 . 

From the scientific as well as historical point of view it is interesting to 
note that Stoner’s suggestions, at least in part, had been proposed earlier 
on the basis of purely chemical considerations. In fact, six months before 
the appearance of Stoner’s paper, Main Smith published an article 206 in the 
Journal of the Society of Chemical Industry in which he not only declared, 
in opposition to Bohr’s original scheme, that “it is certain that the 2 quan¬ 
ta group or L level consists of at least 3 levels, the 3 quanta group or M 
level of 5 levels,” but also inferred from chemical evidence a detailed 
distribution of electrons similar to Stoner’s. Still, Stoner’s article, as com¬ 
pared with Smith’s, was not only more appealing to the mind of physicists 
who rarely read, if at all, the official organ of the Federal Council for Pure 
and Applied Chemistry (London), but also more important because it 
connected the distribution of electrons with the problem of multiplet 
structure. It was this aspect of Stoner’s paper which attracted Pauli’s 
attention—apart from the fact that it challenged Bohr’s review article, 
just referred to , 207 which happened to be one of the papers edited by Pauli 
for Bohr in Copenhagen. Most stimulating for Pauli was Stoner’s remark 
that “twice the inner quantum number does give the observed term multi¬ 
plicity as revealed by the spectra in a weak magnetic field... .In other 
words, the number of possible states of the (core + electron) system is 
equal to twice the inner quantum number, these 2j states being always 
possible and equally probable, but only manifesting themselves separately 
in the presence of the external field ,” 208 together with his previously 
quoted suggestion “that the number of electrons associated with each sub- 
level separately is equal to double the inner quantum number k 2 ,” or as 
Pauli later 209 formulated this statement: “For a given value of the princi¬ 
pal quantum number, the number of energy levels of a single electron in 
the alkali-metal spectra in an external magnetic field is the same as the 
number of electrons in the closed shell of the rare gases which correspond 
to this principal quantum number.” 

Indeed, Stoner’s remark led Pauli to a far-reaching conclusion. Ac¬ 
cepting Stoner’s suggestion that the number of stationary states in a mag¬ 
netic field for given n , k l9 and k 2 is just 2 k 2 ((2/ + 1)) and employing—in 
addition to the quantum numbers n, k u and k 2 —also a quantum number 
m x ((m y )) to represent the component of the angular momentum in the 
direction of the field, Pauli recognized that the shell structure of atoms 
finds its natural explanation if each possible orbit or state is labeled by the 
four quantum numbers n , k l9 k 2 , and m x ((«, /, j 9 /n y )), where 
— (k 2 — ^)<m!<(A: 2 — \) (( — j<mj<j )), and if it is assumed that only 
one electron is allowed to occupy each of these states. 
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In fact, on the basis of these assumptions, as Pauli pointed out, a suble¬ 
vel with given n , k l9 and k 2 contains 

(* 2 -i) + (*2-$) +1=2* 2 ((2/+1)) 

electrons, as suggested by Stoner in contrast to Bohr; a subgroup with 
given n and k x contains 

X 2* 2 = \k x — 2 (Y 2 (2 /+l)=2(2/+l) 

ki = k x 1 \\y= / — ^ 

electrons; and a shell with given n contains 

(4*, - 2) = 2/i 2 (Y £ 2(2/+ 1) = 2n 2 

k t = 1 \\l=0 

electrons, as suggested by Stoner in agreement with Bohr. 

Being constantly aware of Bohr’s fundamental problem as to why not 
all electrons, for the ground state of an atom, occupy the innermost shell, 
Pauli realized that his idea gains general importance if interpreted as oper¬ 
ating as a principle of prohibition. It explained the shell structure of the 
atom and the periods of the periodic system of the elements because it 
excluded the possibility that more than one electron occupies an orbit or 
state as defined above. 

One way to test the principle, Pauli immediately understood, was to 
consider the case of triplet s terms for two “equivalent” electrons (i.e., 
electrons with the same n and k x ) as in the alkaline-earth atoms. In this 
case the set of all four quantum numbers n 9 k l9 k 29 and m x ( (n 9 l 9 j 9 ntj )) 
would be the same for both electrons and hence the principle would de¬ 
mand the exclusion of such terms. Their actual nonappearance in na¬ 
ture—a fact previously unaccounted for—was for Pauli a striking evi¬ 
dence for the validity of his exclusion principle, which he formulated as 
follows: “There never exist two or more equivalent electrons in an atom 
which, in strong fields, agree in all quantum numbers n 9 k l9 k 29 m v \i there 
exists in the atom an electron for which these quantum numbers (in the 
external field) have definite values, this state is ‘occupied.’ ” 210 With re¬ 
gard to this formulation it should be recalled that the physical significance 
of * 2 was still, to say the least, problematic and that Pauli, at that time, was 
able to define the four quantum numbers only in the case of strong mag¬ 
netic fields when the coupling between the electrons was completely 
broken down. However, showing on the basis of thermodynamic reason¬ 
ing 211 that the number of states is invariant under a transition from strong 
to weak fields, Pauli was able to establish the general validity of the exclu¬ 
sion principle. In concluding his paper, Pauli expressed the hope that at 
some future date a more profound understanding of the fundamental prin¬ 
ciples of quantum theory would enable us to put the exclusion principle on 
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firmer foundations. For the time being, however, he had to admit that “no 
deeper motivation of the rule can be provided.” 212 

With the subsequent development of the formalism of quantum me¬ 
chanics and especially through the work of Heisenberg, 213 Dirac 214 — 
Dirac was the first to introduce a determinantal formulation for the accep¬ 
table wave functions, a formulation which after the appearance of Slater’s 
paper 215 was usually referred to as the “Slater determinant”—and 
Pauli, 216 it was recognized that the essence of the principle is the require¬ 
ment that all state functions, with the inclusion of the spin functions, of a 
system of similar particles, provided they are fermions, must be antisym¬ 
metric with respect to particle exchange. Thus it became clear that the 
principle of exclusion, like that of relativity, is not merely another 
theorem in physics but rather a general precept regulating the very formu¬ 
lation of physical laws. It was therefore no surprise that the range of its 
applications proved extremely wide: it was important for an understand¬ 
ing of the shell structure of atoms, of the energies of electrons in metals, of 
chemical valency, of ferromagnetism in crystals, and many other phe¬ 
nomena. 

With the exception of a study by Henry Margenau 217 philosophical 
analysis has so far shown little interest in the exclusion principle in blatant 
contrast to its inquisitiveness concerning the relativity principle. This in¬ 
difference, as Margenau mentioned, was certainly due to the fact that the 
exclusion principle was “bom amid a frenzy of factual discoveries,” that 
its results appeared less appealing and less paradoxical than those of rela¬ 
tivity, and that the problems solved by it were either too new or too old to 
stir excitement. An additional reason, however, which Margenau seems to 
have overlooked, was undoubtedly the fact that the invariance require¬ 
ment imposed by the relativity principle led to radical revisions of time- 
honored conceptions such as space and time, whereas the corresponding 
requirement of the exclusion principle, that is, the postulated antisym¬ 
metry of state functions (for fermions) seemed to have little bearing on 
the physical or philosophical significance of these functions per se. Mar- 
genau’s discussion on the relation between the Pauli principle, on the one 
hand, and the principle of causality, on the other, deserves still today 
serious consideration on the part of every physicist interested in the foun¬ 
dations of his science. 

In addition to the above-mentioned classification of the electrons in the 
atom by the quantum numbers n, k u k 2 , and m l Pauli suggested, as an 
alternative in the case of strong magnetic fields, replacing k 2 by a magnetic 
quantum number m 2 which represented, in appropriate units, the magnet¬ 
ic interaction energy of the atom with the external field. Pauli thus showed 
not only that it was possible to determine the maximum occupation num¬ 
ber of atomic shells but that one also could predict—with the help of the 
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total angular momentum in the direction of the field I/Mj and the total 
magnetic interaction energy with the external field —the type of mul- 

tiplets that can arise if the number of electrons for given n and k l is known. 
For alkali doublets the total angular momentum j of the atom (in units of 
h /2ir) is k 2 — \ or, since k 2 is k x or k x — 1 , either k l — \ or k l — i.e., in 

modem notation j = l + s 9 where s = + In addition, Pauli showed that 
in strong fields the component of the total angular momentum in the 
direction of the field is given by m l + m s (in modern notation) whereas 
the interaction energy (in units of fi B H) is given by m l + 2m s . Here m l 
represents the component of the orbital angular momentum / in the direc¬ 
tion of the field and m 5 = + Since Pauli, as we have seen, did not 
associate s, as had been done hitherto, with the core but rather with the 
valence electron itself and its “ classically not describable two-valued- 
ness,” it should have been an easy matter for him to ascribe to the electron 
an intrinsic angular momentum m s = ± \ (in units of h /2ir) and a corre¬ 
sponding intrinsic magnetic moment 2 m s (in units of fi B ). The historical 
question as to why Pauli was not the first to make these assumptions is the 
subject of a careful study by van der Waerden, 218 to which the interested 
reader is referred for further details. 

Nevertheless, just as Stoner’s paper on the distribution of electrons was 
instrumental for Pauli’s discovery of the exclusion principle, so, in turn, 
was Pauli’s paper on the exclusion principle influential for the discovery 
of the spin. In fact, the concept of the spinning electron was conceived— 
only to be rejected—by Kronig and was later rediscovered, so to speak, by 
Goudsmit and Uhlenbeck under the influence of Pauli’s work. 

In January, 1925, R. Kronig, at that time a traveling fellow of Colum¬ 
bia University, visited Tubingen, the Mecca of spectroscopists, to work 
with Lande, Gerlach, and Back . 219 Lande had just received “a long and 
very interesting letter” from Pauli with details on the exclusion principle 
and the assignment of the four quantum numbers. When Kronig was 
shown this letter and read Pauli’s remarks concerning the angular mo¬ 
mentum j and its possible values / + \ and / — it occurred to him imme¬ 
diately that the difference between the total angular momentum j and the 
orbital angular momentum / “might be considered as an intrinsic angular 
momentum” of the electron. In addition he assumed that this intrinsic 
angular momentum, which he interpreted dynamically as originating 
from a spinning motion of the electron about its own axis, is accompanied 
by a magnetic moment of one Bohr magneton. The same afternoon he put 
his new ideas to test by computing the doublet separations in the fine 
structure of alkali spectra. The size of these separations, as, for example, 
the 6 A between the two D lines in the sodium spectrum, was a puzzle at 
that time. Recalling that according to the Lorentz transformations for 
electromagnetic field components the electrostatic field in the rest frame 
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of the nucleus gives rise to a magnetic field in the instantaneous rest frame 
of the electron, he calculated the energy of interaction between the intrin¬ 
sic magnetic moment of the electron and the magnetic field which, of 
course, apart from a factor of proportionality turned out to be the scalar 
product of the orbital momentum and the spin vector. Kronig thus ob¬ 
tained a doublet splitting proportional to the fourth power of the effective 
nuclear charge or, more precisely, proportional to the product of the 
squares of the screened nuclear charges inside and outside the atomic 


core. His result was in full agreement with experience and also in accord 
with Lande’s semiempirical “ relativistic splitting rule” 220 but in striking 


contrast to the magnetic-core theory, according to which the splitting 


should increase with only the third power of the effective nuclear charge. 


Encouraged by this qualitative result, Kronig decided to put the new 
hypothesis also to a quantitative test by applying it to the interpretation of 
the simplest of all spectra, the spectrum of hydrogen or of hydrogenlike 
atoms. This, however, raised a serious conceptual difficulty because Som¬ 
merfeld’s (spin-free) relativistic treatment, as we have seen, accounted 
extremely well for the observed fine structure of such spectra. The as¬ 
sumption of an additional spin-orbit interaction could therefore only des¬ 
troy this agreement unless, for some reason or other, Sommerfeld’s rela¬ 
tivistic formula (3.4), page 98, could be differently interpreted as the 
result of a compensation between the spin-orbit coupling and the relativis¬ 
tic precession of the electronic orbit in its plane. Now, the observed viola¬ 
tion of the selection rule A k= + 1, as mentioned on page 98, and the 
meanwhile well-established selection rule A/ = + 1 or 0 for the inner 
quantum number as well as other arguments 221 suggested that the azi¬ 
muthal quantum number k in Sommerfeld’s formula (3.4) should be re¬ 
placed by j + a. Consequently, Kronig tried to find out whether the spin- 
orbit interaction does, in fact, compensate the relativistic precession in 
such a way that for every given j the two energy levels for / =j + \ and 
l=j coincide. For only such a degeneracy, Kronig presumed, could 
lead “to the same number and pattern of energy levels as predicted by 
Sommerfeld’s original treatment.” His attempts to obtain this agreement 
with experience, however, failed. The result of his calculation always dif¬ 
fered from the observed value by a factor of 2. In addition, a simple calcu¬ 
lation showed that a point on the surface of the spinning electron of classi¬ 
cal radius and intrinsic angular momentum h /Air would necessarily have 
a velocity many times greater than that of light. Finally, when also Pauli, 
Kramers, and Heisenberg with whom he discussed this matter, rejected 
his hypothesis, Kronig decided to leave his ideas unpublished. Thus, al¬ 
though the atom was still unhesitatingly regarded as a tiny solar system, 
Kronig’s suggestion of the spin was considered a rather strange and pecu¬ 
liar idea; Pauli reportedly called it “a flash of wit” (“ein ganz witziger 
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Einfall”). It seems that the main reason for the rejection of the spin was 
the relativistic prohibition of velocities greater than that of light. 

Although Kronig, as far as we know, was the first to apply the notions 
of spin and intrinsic magnetic moment to the interpretation of spectra, the 
very idea of a magnetic electron was not new. In fact, even ten years 
earlier, Parsons’ 222 speculative theory of magnetons anticipated the exis¬ 
tence of intrinsically magnetic elementary particles spinning about their 
own axis with an angular momentum such that “the velocity at the cir¬ 
cumference” of the particles is that of light. In 1921 A. H. Compton, who 
in collaboration with O. Rognley studied the intensity of x-rays reflected 
from crystal surfaces, found that this intensity is unaffected by a variation 
of the electronic distribution on the surfaces by the application of magnet¬ 
ic fields. On the basis of this evidence he thought it difficult to avoid the 
conclusion “that the elementary magnet is not the atom as a whole .... 
Since neither the molecule nor the atom gives a satisfactory explanation of 
the experiments, the view suggests itself that it is something within the 
atom, presumably the electron, which is the ultimate magnetic parti¬ 
cle.” 223 At the end of his paper Compton declared: “May I then conclude 
that the electron itself, spinning like a tiny gyroscope, is probably the 
ultimate magnetic particle.” Compton apparently never applied his hy¬ 
pothesis to the anomalous Zeeman effect, most probably because, as it 
seems, he was never particularly interested in spectroscopy as such. But 
even had he done so, it is unlikely that prior to Lande’s penetrating analy¬ 
sis a satisfactory understanding of this subject would have been obtained. 

But it was precisely *in spectroscopy where the need for an additional 
degree of freedom for the electron was most urgently felt and particularly 
so in the interpretation of the alkali doublets. For it was found that al¬ 
though their separations could formally be accounted for fairly well by 
Sommerfeld’s relativistic treatment, as mentioned previously, the relativ¬ 
istic mechanism underlying this approach could not possibly apply. For 
such a mechanism presupposed orbits of radically different eccentricities 
whereas, as was shown by Millikan and Bowen, there was abundant evi¬ 
dence that the same azimuthal quantum number has to be assigned to both 
components of these doublets. Millikan and Bowen thus declared early in 
1924: “The only way in which it appears to be possible to avoid the forego¬ 
ing serious difficulties is to throw overboard altogether the relativity ex¬ 
planation of the ‘relativity doublet’ and to assume that the amazing suc¬ 
cess of this relativity formula...is not due at all to differences in the shapes 
of elliptical and circular orbits, as postulated by the relativity theory of 
doublet separations, but that there is some other cause which by mere 
chance leads exactly to this relativity formula without actually necessitat¬ 
ing relativity conceptions.” 224 The need for an additional degree of free¬ 
dom is clearly expressed in their concluding remarks: “...some way must 
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be found to permit two orbits which have the same shape (azimuthal 
quantum numbers) but different orientations (inner quantum numbers) 
to possess widely different screening constants, i.e. widely different ener¬ 
gies. This would seem to require the introduction of a dissymmetry not 
heretofore contemplated into atomic models.” 225 

How this fourth degree of freedom of the electron was finally, in the 
wake of Pauli’s work, introduced by Goudsmit and Uhlenbeck, indepen¬ 
dently of Kronig’s investigations, and became part of the conceptual ap¬ 
paratus of modem physics has been told in detail by Uhlenbeck himself: 

“Goudsmit and myself hit upon this idea by studying a paper of Pauli, 
in which the famous exclusion principle was formulated and in which, for 
the first time, four quantum numbers were ascribed to the electron. This 
was done rather formally; no concrete picture was connected with it. To 
us this was a mystery. We were so conversant with the proposition that 
every quantum number corresponds to a degree of freedom, and on the 
other hand with the idea of a point electron, which obviously had three 
degrees of freedom only, that we could not place the fourth quantum 
number. We could understand it only if the electron was assumed to be a 
small sphere that could rotate.... 

“Somewhat later we found in a paper of Abraham (to which Ehrenfest 
drew our attention) that for a rotating sphere with surface charge the 
necessary factor two (in the magnetic moment) could be understood clas¬ 
sically. This encouraged us, but our enthusiasm was considerably reduced 
when we saw that the rotational velocity at the surface of the electron had 
to be many times the velocity of light! I remember that most of these 
thoughts came to us on an afternoon at the end of September 1925. We 
were excited, but we had not the slightest intention of publishing any¬ 
thing. It seemed so speculative and bold, that something ought to be 
wrong with it especially since Bohr, Heisenberg and Pauli, our great au¬ 
thorities had never proposed anything of the kind. But of course we told 
Ehrenfest. He was impressed at once, mainly, I feel, because of the visual 
character of our hypothesis, which was very much in his line. He called 
our attention to several points (e.g. to the fact that in 1921 A. H. Compton 
already had suggested the idea of a spinning electron as a possible explana¬ 
tion of the natural unit of magnetism) and finally said that it was either 
highly important or nonsense, and that we should write a short note for 
Naturwissenschaften and give it to him. He ended with the words ‘und 
dann werden wir Herm Lorentz fragen.’ This was done. Lorentz received 
us with his well known great kindness, and he was very much interested, 
although, I feel, somewhat sceptical too. He promised to think it over. 
And in fact, already next week he gave us a manuscript, written in his 
beautiful handwriting, containing long calculations on the electromagnet¬ 
ic properties of rotating electrons. We could not fully understand it, but it 
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was quite clear that the picture of the rotating electron, if taken seriously, 
would give rise to serious difficulties. For one thing, the magnetic energy 
would be so large that by the equivalence of mass and energy the electron 
would have a larger mass than the proton, or, if one sticks to the known 
mass, the electron would be bigger than the whole atom! In any case, it 
seemed to be nonsense. Goudsmit and myself both felt that it might be 
better for the present not to publish anything; but when we said this to 
Ehrenfest, he answered: ‘Ich habe Ihren Brief schon langst abgesandt; Sie 
sind beide jung genug um sich eine Dummheit leisten zu konnen !’ 99226 

This, then, is the story of how eventually, on November 20, 1925, the 
idea of the spinning electron was published in Die Naturwissenschaften in 
a short paper entitled “ Replacement of the hypothesis of the nonmechani¬ 
cal stress by a postulate concerning the intrinsic behavior of every single 
electron.” 227 As the title indicates, it was the authors’ intention to elimi¬ 
nate from atomic physics the hypothesis of the “nonmechanical stress” in 
terms of which Bohr 228 had attempted to explain the doublet structure. 
According to this assumption the coupling between the valence electron 
and the atomic core should not be regarded as a coupling between two 
periodicity systems but was supposed, owing to the “mechanically not 
describable stability conditions,” to impose upon the core a stress 
(“Zwang”) which, though not reducible to the effect of an external field 
of force, enables the core to assume two orientations rather than one. 
Referring explicitly to Pauli’s “classically not describable two-valued- 
ness” and his insistence on the use of four quantum numbers, Uhlenbeck 
and Goudsmit proposed to reinterpret the vector R , which hitherto was 
thought to represent the angular momentum of the core, as due to the 
“intrinsic rotation” (“eigene Rotation”) of the electron. In addition, to 
obtain full agreement between theory and experience, they postulated that 
“the ratio of the magnetic moment of the electron to its mechanical angu¬ 
lar momentum for the intrinsic rotation is twice as great as for the orbital 
motion,” referring the reader to Abraham’s classic calculations. 229 They 
also admitted, though only in a footnote, that their proposal would lead to 
a “peripheral velocity” of the electron which is many times that of light. 

In a letter to the editor of Nature 230 in which Uhlenbeck and Goudsmit 
reasserted their arguments for the introduction of the spin, Bohr added a 
few lines expressing his acceptance of the new ideas. With reference to his 
own hypothesis, now challenged by these authors, he declared: “In my 
article expression was given to the view that these difficulties were inher¬ 
ently connected with the limited possibility of representing the stationary 
states of the atom by a mechanical model. The situation seems, however, 
to be somewhat altered by the introduction of the hypothesis of the spin¬ 
ning electron which, in spite of the incompleteness of the conclusions that 
can be derived from models, promises to be a very welcome supplement to 
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our ideas of atomic structure.... Indeed, it opens up a very hopeful pros¬ 
pect of our being able to account more extensively for the properties of 
elements by means of mechanical models, at least in the qualitative way 
characteristic of applications of the correspondence principle.” 231 

Bohr’s optimistic reaction did not yet prepare the way for a general 
acceptance of the new hypothesis. A few weeks after the publication in 
Nature Kronig, in a reply, 232 reemphasized the main arguments against 
the acceptance of the spin hypothesis and pointed to a new additional 
difficulty. For Pauli 233 had already suggested in August, 1924, explaining 
the hyperfine structure in the Zeeman pattern of heavy atoms by the 
assumption of a nonvanishing nuclear angular momentum, an assump¬ 
tion which established the first connecting link between (atomic) spec¬ 
troscopy and nuclear dynamics. Since the study of the Zeeman effect 
clearly shows, Kronig argued, that electrons possess invariably a magnet¬ 
ic moment of one Bohr magneton, no matter in what orbit or in what atom 
they are found, one should expect that the same holds also for the electron 
when it is part of the nucleus. 234 If according to the spin hypothesis every 
electron had really an intrinsic moment of one magneton, Kronig con¬ 
tended, then the conclusion cannot be avoided that the magnetic moment 
of the nucleus is of the same order of magnitude, “unless the magnetic 
moments of all nuclear electrons just happened to cancel.” But in his view 
there is no place in the theory of the Zeeman effect for such an additional 
moment of the nucleus and the probability that in all atomic nuclei the 
magnetic moments just cancel each other is a priori almost nil. Kronig 
thus concluded his reply with the words: “The new hypothesis, therefore, 
appears rather to effect the removal of the family ghost from the basement 
to the subbasement, instead of expelling it definitely from the house.” 

The situation changed, however, in the spring of 1926, when Thom¬ 
as 235 and subsequently Frenkel 236 succeeded in correcting the error which 
led to the disturbing factor of 2 in the calculation of the doublet separa¬ 
tions. It was Thomas’s paper and the correct computation of the doublet 
separation which in March, 1926, converted even Pauli to accept the spin 
hypothesis—Pauli, who still in January, in a conversation with Bohr, had 
called it an “Ifrlehre” (“heresy”). From our modem point of view Pauli’s 
obstinate opposition to the notion of spin was fully justified as was his 
original conception of it as a “classically not describable two-valuedness.” 
For as he said himself: “Although at first I strongly doubted the correct¬ 
ness of this idea because of its classical mechanical character, I was finally 
converted to it by Thomas’s calculations on the magnitude of doublet 
splitting. On the other hand, my earlier doubts as well as the cautious 
expression ‘classically not describable two-valuedness’ experienced a cer¬ 
tain verification during later developments, as Bohr was able to show on 
the basis of wave mechanics that the electron spin cannot be measured by 
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classically describable experiments (as, for instance, deflection of molecu¬ 
lar beams in external electromagnetic fields) and must therefore be con¬ 
sidered as an essentially quantum-mechanical property of the elec¬ 
tron.” 237 

How Pauli 237a succeeded a year later in formulating within the frame¬ 
work of nonrelativistic quantum mechanics a consistent theory of the 
spinning electron with the use of his celebrated spin matrices and how his 
two-component equations—in the presence of some external field—soon 
turned out to be the nonrelativistic limit of Dirac’s famous relativistic 
equation of the electron 2375 is well known to every student of modem 
physics. What is not so well known, however, is the following. The so 
often repeated statement—suggested, probably, by this peculiar historical 
development—that “the spin is a purely relativistic effect” is a misconcep¬ 
tion. For it can be shown 237c that a consistent theory of spin \ with the 
correct value of the intrinsic magnetic moment can be established without 
ad hoc assumptions within the framework of Galileo-invariant, and not 
necessarily Lorentz-invariant, wave equations, though with the exclusion 
of spin-orbit interactions and the Darwin term (which thus are truly rela¬ 
tivistic effects). Pauli’s description of the spin “as an essentially quantum- 
mechanical property” remains true, however, as can easily be shown by 
taking the limit h -►O. 

As we have seen, the notion of the spin of the electron raised, within the 
framework of the older quantum theory and its revised vector model, 
serious conceptual difficulties—but it worked. It was typical, in this re¬ 
spect, for many other notions and relations in terms of which the older 
quantum theory described atomic processes surprisingly well without, 
however, being able to provide strictly logical proofs. In fact, the major 
part of the descriptive contents of the older theory could be carried over 
without difficulties into the new quantum mechanics as a way of describ¬ 
ing by models, though only to a limited extent, the rigorous results of the 
theory. 

Reviewing the development of the older quantum theory, we recall that 
it was essentially a restriction of classical dynamics allowing only for 
orbits or states which satisfy the quantum conditions. These quantum 
conditions, although to some extent deprived of their mysterious nature 
by Ehrenfest’s adiabatic principle, were never given a deeper logical justi¬ 
fication. In addition, the range of their applicability, it turned out, was 
severely limited. The following considerations will clarify this point. In 
analogy to the methods of classical dynamics in its treatment of the many- 
body problem also the older quantum theory developed, mainly through 
the work of Bohr, Bom, Brody, and Epstein, a theory of perturbations. At 
first such an approach seemed rather futile since, contrary to celestial 
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mechanics, where the attraction of the central body has a much higher 
order of magnitude than the perturbations by the planets among them¬ 
selves, the repulsive forces among the electrons are of the same order of 
magnitude as the force of attraction exerted on them by the nucleus. These 
presumably insurmountable difficulties, however, were overcome precise¬ 
ly by that feature of quantum theory by which it differs from classical 
dynamics, namely, the quantum conditions. Yet in spite of the successful 
development of an intrinsically consistent theory of perturbations, the 
results of its applications were frustrating as the study of the simplest 
possible case, the three-body problem of the neutral helium atom, clearly 
showed. Bohr, in fact, abandoned early all hope for a strictly analytic 
study of polyelectronic systems. His very conception of “penetrating or¬ 
bits” which he used to obtain qualitative results was, strictly speaking, the 
first renunciation of his own conception of atomic dynamics as a theory of 
periodic systems. The penetrating orbit could no longer be treated as a 
periodic process because of the strong interaction with the atomic core. 
Moreover, Bohr’s introduction of the conception of “penetrating orbits” 
initiated the treatment of the atom by models, a method which in its 
further elaborations by Sommerfeld and particularly by Lande gradually 
lost contact with the theory of periodic or conditionally periodic motions. 
The necessity of introducing half odd-integral quantum numbers, the ap¬ 
pearance oil = k—\ instead of the theoretically required k, the still ob¬ 
scure relationship between the spin-orbit interaction and the relativity 
correction in the case of the hydrogen atom, and above all the absence of 
any definite methodology—all these deficiencies were a clear indication 
that the older quantum theory could be regarded as only a first step to¬ 
ward an as yet unknown satisfactory theory. 

Even conservative physicists such as Mie agreed that fundamental revi¬ 
sions of classical dynamics were imperative. In an address delivered in 
1925 in Freiburg, Mie 238 explained that quantum theory owes its origin— 
and its difficulties—to the attempt “to connect the physics of matter with 
the physics of the ether.” He continued: “The problem of matter cannot be 
expected to find its solution until the hitherto accepted Hamiltonian prin¬ 
ciple is replaced by a principle which can serve as the foundation of a 
theoretical edifice of both a new mechanics and a new physics of the 
ether.” 

If Mie meant to say that the foundations of the new mechanics would 
have to be a coalescence of optics and particle dynamics, his prediction 
was certainly correct. In fact, the very term “quantum mechanics,’’which 
made its first appearance as a terminus technicus in the title of Bom’s 
paper “On quantum mechanics” 239 in 1924, had already been coined to 
mean a “quantum mechanics of coupling” or of interaction and was em¬ 
ployed by Bohr to denote his attempt to apply Kramers’s treatment 240 of 
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the interaction between light waves and electrons to the mutual interac¬ 
tion between the electrons themselves. It was to denote a theory of me¬ 
chanics consistent with Kramers’s dispersion formula. The formation of 
the new quantum mechanics, with respect to both its conceptual contents 
and its methodological procedures, did indeed originate in the study of 
optical problems. Our analysis of the conceptual development of modem 
quantum mechanics has therefore to begin with a discussion of these prob¬ 
lems. 

Before we start this discussion in the following chapter, let us make a 
few remarks concerning the term “quantum mechanics.” In his Nobel 
Prize Lecture, delivered in Stockholm on December 11, 1954, on “The 
Statistical Interpretation of Quantum Mechanics” 241 Bom stated that his 
1924 paper 242 introduced “probably for the first time” (“wohl zum ersten 
Male”) the term “ quantum mechanics” (“ Quantenmechanik”), as we 
have just mentioned. This statement was repeated by Amim Hermann in 
his biography of Bom when he said that “the new word appeared for the 
first time in Bom’s paper of 1924... .” 243 It should be noted, however, that 
the almost identical expression “mechanics of quanta” had already been 
used earlier. Thus, for example, H. A. Lorentz, in his address “The Old 
and the New Mechanics,” 244 delivered at the Sorbonne on December 10, 
1923, at the occasion of the Jubilee Celebration of the Societe Frangaise de 
Physique, declared: “All this has great beauty and extreme importance, 
but unfortunately we do not really understand it. We do not understand 
Planck’s hypothesis concerning oscillators nor the exclusion of nonsta¬ 
tionary orbits, and we do not understand, how, after all, according to 
Bohr’s theory, light is produced. There can be no doubt, a mechanics of 
quanta, a mechanics of discontinuities, has still to be made.” 245 These 
words have been quoted not only because of the terminological issue but 
also because Lorentz’s statement reflected a general feeling that was pre¬ 
vailing at the time among many theoreticians and expressed their assess¬ 
ment of the situation of theoretical physics prior to the advent of quantum 
mechanics. 

In the context of our present discussion of Bom’s introduction of the 
term “quantum mechanics” we wish to make a few remarks on our use of 
this term in the present text. We shall call “quantum physics” that branch 
of experimental and theoretical physics which studies quantum phenome¬ 
na that is, physical processes which involve Planck’s constant h, and 
“quantum theory” its theoretical part. Quantum theory thus comprises 
“quantum field theory” and “quantum mechanics.” The latter, as used in 
our work will denote the general abstract theory irrespective of its particu¬ 
lar representation such as matrix mechanics or wave mechanics. These, 
consequently, will be regarded as special formulations of quantum me¬ 
chanics. 
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Whether the suggested terminology, ultimately a matter of definition, 
is appropriate depends on the particular viewpoint adopted. Those who, 
like de Broglie 246 and his school, consider undulatory processes as of pri¬ 
mary importance would probably ascribe to wave mechanics a more pref¬ 
erential status. Those who, like Lande or, on different grounds, Mackey, 
view quantum mechanics as essentially a development of statistical me¬ 
chanics may even object to the term itself, as, indeed, Mackey did when he 
wrote: “We should like to emphasize that in quantum mechanics the 
quantum rules are a deduction and not a fundamental feature of the for¬ 
mulation of the theory. In this sense the name quantum mechanics is 
unfortunate. It is basically a revision of statistical mechanics in that one 
studies the change in time of probability measures but no longer supposes 
that the motion of these measures is that induced by a motion of points in 
phase space. The laws of quantum mechanics place certain restrictions on 
the possible simultaneous probability distributions of various observables 
and give differential equations which may be integrated to show how they 
change in time. Everything else is deducible from these laws.” 247 
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CHAPTER 4 

The Transition to 
Quantum Mechanics 


4.1 Applications of Quantum Conceptions to Physical Optics 

Since the inception of quantum theory—in fact, since Planck’s “compro¬ 
mise” 1 prior to his discovery of the elementary quantum of action—the 
conflict between the undulatory and the corpuscular conceptions of radi¬ 
ation became more and more acute, particularly so, as we have seen, under 
the impact of Einstein’s ideas on light quanta and energy fluctuations. 
Initially regarded as merely a working hypothesis or, as Einstein called it 
apologetically “a heuristic viewpoint,” 2 the quantum-corpuscular view 
on the nature of radiation became eventually a serious challenge to the 
undulatory theory of light. It proved particularly important for the inter¬ 
pretation of processes in physical optics which involved interactions of 
radiation and matter. 

It was primarily Arthur Holly Compton’s experiment which put the 
quantum-corpuscular view on firm empirical foundations. In fact, Comp¬ 
ton contributed to the acceptance of the quantum view of radiation just as 
much as Fresnel had done a century earlier for the acceptance of the 
classical wave theory of light. It should be noted, however, that whereas 
Fresnel’s famous memoir 3 of 1818 was a theoretical vindication of 
Young’s experimental work, Compton’s contribution was an experimen¬ 
tal confirmation of Einstein’s theoretical conclusions. In view of its deci¬ 
sive importance for the basic conceptions of modem quantum mechanics 
the Compton experiment will be discussed in greater detail. 

As Bartlett 4 recently pointed out, the qualitative aspects of the basic 
facts underlying the Compton effect had been well known from the inves¬ 
tigations of scattered y rays long before Compton began his study of the 
scattering of x-rays. In fact, as early as 1904 Eve 5 investigated the scatter¬ 
ing of y radiation emitted from radium and found that the secondary rays 
differ in their power of penetrability from the primary radiation. That 
secondary radiation has much greater coefficients of absorption than pri¬ 
mary radiation was shown in 1908 by Kleeman. 6 Similar results were 
obtained by Madsen 7 and Florance. 8 The latter studied the scattered radi- 
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ation by means of an electroscope which was moved along an arc at con¬ 
stant distance from the scatterer. The experiments revealed that the wave¬ 
length of the secondary radiation depends on the angle of scattering. That 
also for x-rays scattered radiation is less penetrating than primary radi¬ 
ation was shown by Sadler and Mesham 9 in 1912. Since the measurement 
of absorption coefficients was at that time the only technique used for 
investigating the quality of these radiations, no accurate quantitative re¬ 
sults could be obtained. 

The theory of high-frequency radiation scattering, prevailing at that 
time, was developed primarily by J. J. Thomson 10 on the basis of Max¬ 
well’s electrodynamics. According to this theory, whenever an electro¬ 
magnetic pulse traverses an atom, it sets the electrons into forced oscilla¬ 
tions and the radiation emitted by these electrons in virtue of their 
accelerations is identified with the scattered radiation. As a simple calcu¬ 
lation shows, 11 the intensity I e observed at a distance r from the scatterer 
and at a scattering angle 6 relative to the direction of the primary beam of 
intensity / is given, per unit volume of matter, by the equation 


Ine 4 ( 1 + cos 2 0) 
2 m 2 r 2 c 4 


(4.1) 


where n is the number of electrons per unit volume, m and e the mass and 
charge of the electron, and c the velocity of light. Thus, the energy scat¬ 
tered is independent of the wavelength of the incident rays. The formula 
agreed fairly well with experiments on soft x-rays and not too small scat¬ 
tering angles, as shown, for example, by Barkla and Ayres. 12 

The first who questioned the validity of Thomson’s theory was Gray, 
who, incidentally, was also one of the first to establish that y rays and x- 
rays are of the same nature. Reexamining the experimental results ob¬ 
tained by Madsen and Florance, Gray suggested as early as 1913 “that the 
usual explanation of the scattering of gamma rays cannot be quite cor¬ 
rect ... as it appeared that when the intensity of the primary rays was 
diminished by lead, the softer scattered rays were not cut down so quickly 
as one would expect.” 13 Gray also emphasized that “the quality and quan¬ 
tity of the scattered radiation is approximately independent of the nature 
of the radiator.” 14 In 1917 Ishino 15 showed that the total secondary y- 
radiation is much less than Thomson’s formula predicted. 

When Gray 16 resumed his experiments on the scattering of y rays of 
radium in 1920, he recognized the inadequacy of merely comparing the 
qualities of the primary and scattered radiations by absorption measure¬ 
ments. Referring to his own measurements, he stated that “the results we 
have obtained would be explained if we could always look on a beam of x- 
or gamma-rays as a mixture of waves of definite frequencies, and if rays of 
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a definite frequency were altered in wave-length during the process of 
scattering, the wave-length increasing with the angle of scattering,” 17 It is 
interesting to note that the Braggs, 18 in their classic paper on x-ray diffrac¬ 
tion, had already given in 1913 a detailed description of the well-known x- 
ray spectrometer which carries their name—they referred to it as “an 
apparatus resembling a spectrometer in form, an ionisation chamber tak¬ 
ing the place of the telescope”—and that so far no use of it had been made 
for the study of scattering. 

That the intensity of scattered x-rays of very short wavelength is con¬ 
siderably less than predicted by the Thomson formula (4.1) has mean¬ 
while been confirmed irrefutably by numerous experiments. All attempts 
at explanations on the basis of the classical electron theory—for example, 
Schott’s 19 introduction of constraining and damping forces on the elec¬ 
trons or Compton’s 20 theory of the “large electron” whose dimensions are 
comparable with the wavelength of hard y rays and may therefore give rise 
to interference effects between the rays scattered from the different parts 
of the electron—failed to account for the observed facts. A radically new 
approach was necessary and this approach was worked out by Compton. 

Compton, like Gray, began his work with the study of y -ray scattering. 
As his early paper on “The degradation of gamma-ray energy” 21 showed, 
he was interested not only in the experimental aspects of “the nature and 
the general characteristics of secondary gamma rays” but also in an expla¬ 
nation of “the mechanism whereby comparatively soft secondary radi¬ 
ation is excited by relatively hard primary radiation.” From the beginning 
of his work, as we see, the search for a mechanism of the scattering was at 
the forefront of his investigations. In 1921 Compton 22 employed for the 
first time the Bragg spectrometer “in combination with a recording device 
to register the intensity of the secondary radiation for different angles of 
the ionization chamber.” This made it possible for him to realize that “in 
addition to scattered radiation there appeared in the secondary rays a type 
of fluorescent radiation, whose wavelength was nearly independent of the 
substance used as a radiator, depending only upon the wavelength of the 
incident rays and the angle at which the secondary rays are examined,” 23 
Compton found that the wavelength of the rays was increased in the scat- 

o 

tering process by about 0.03 A over the wavelength of the primary ray. 
After repeated measurements with the x-ray spectrometer Compton cor- 

o 

rected this result to 0.02 A. At first he attempted to account for this 
observed softening of the secondary radiation within the framework of 
classical conceptions and thus took recourse to the Doppler effect. He 
found, however, that in order to account for the observed shift, all of the 
electrons in the scatterer would have to be moving in the direction of the 
incident radiation with a velocity of about half that of light. Since such an 
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assumption was obviously unacceptable, Compton concluded that the 
classical theory was irreconcilable with experience. 

At that time Compton was a member of the Committee on X-ray Spec¬ 
tra of the Division of Physical Sciences in the National Research Council. 
The Committee, which under the chairmanship of William Duane (Har¬ 
vard University) included Bergen Davis (Columbia University), A. W. 
Hull (General Electric Research Laboratories), and D. L. Webster (Le- 
land Stanford University), was just about to publish a report on this field 
of research. Duane objected to including Compton’s results in this report, 
claiming that the evidence was inconclusive. Only owing to the insistence 
of Hull, Compton’s results were published in the report. 24 

Since Thomson’s theory and the Doppler effect did not explain the 
experimental results, Compton examined “what would happen if each 
quantum of x-ray energy were concentrated in a single particle and would 
act as a unit on a single electron.” 25 On the basis of this quantum-corpus¬ 
cular assumption and on the basis of the conservation principles of energy 
and momentum, Compton derived in a way well known to every student 
of modem atomic physics his famous formula for the change of wave¬ 
length 


A A = 



(1 — cos 9) 


(4.2) 


where 9 is the scattering angle, his formula for the energy of the recoil 
electron 


•Ekin = hv 


2a cos 2 <p 


, - (4.3) 

(1 + a) — a cos <p 

where <p is the angle of recoil and a = hv/mc 2 , as well as the relation 
between the scattering angle and the angle of recoil 


cot <p = — (1 + a)tan \9 (4.4) 

Using the K a line of molybdenum as primary radiation and graphite as 
scatterer, Compton measured repeatedly the angle dependence of the 
scattered radiation in his laboratory at Washington University in St. 
Louis and found full agreement with formula (4.2). He thus felt justified 
in making the first report on his new interpretation of x-ray scattering at 
the Washington meeting of the American Physical Society, which took 
place on April 20 and 21, 1923. 26 One month later the Physical Review 
published his celebrated paper “A quantum theory of the scattering of x- 
rays by light elements,” 27 in which he declared: “The present theory de¬ 
pends essentially upon the assumption that each electron which is elec¬ 
tive in the scattering scatters a complete quantum. It involves also the 
hypothesis that the quanta of radiation are received from definite direc- 
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tions and are scattered in definite directions. The experimental support of 
the theory indicates very convincingly that a radiation quantum carries 
with it directed momentum as well as energy. 28 

For the sake of historical accuracy it should be noted that about the 
same time also Peter Debye, then at Zurich, suggested, like Compton, that 
x-ray scattering can be explained by the assumption that an x-ray quan¬ 
tum loses energy by collision with an electron. In fact, Debye’s paper 29 
containing this suggestion had already been received by the editor of the 
Physikalische Zeitschrift in March 1923, one month before Compton 
made his first report. 

Compton’s results became the subject of passionate discussions at the 
December, 1923, meeting of the American Physical Society in Washing¬ 
ton. Compton recalls that “having frequently repeated the experiments I 
entered the debate with confidence, but was nevertheless pleased to find 
that I had support from P. A. Ross of Stanford and M. de Broglie of Paris, 
who had obtained photographic spectra showing results similar to my 
own. Duane at Harvard with his graduate students had been able to find 
not the same spectrum of the scattered rays, but one which they attributed 
to tertiary x-rays excited by photoelectrons in the scattering material. I 
might have criticized his interpretation of his results on rather obvious 
grounds, but thought it would be wiser to let Duane himself find the 
answer. Duane followed up this debate by visiting my laboratory (at that 
time in Chicago) and invited me to his laboratory at Harvard, a courtesy 
that I should like to think is characteristic of the true spirit of science. The 
result was that neither of us could find the reason for the difference in the 
results at the two laboratories, but it turned out that the equipment that I 
was using was more sensitive and better adapted than was Duane’s to a 
study of the phenomenon in question.” 30 

During the second week of August, 1924, the British Association for 
the Advancement of Science had its 92d annual meeting in Toronto. A full 
afternoon (Friday, August 8) was dedicated to a discussion on x-ray 
scattering. 31 Compton defended his theory while Gray and Duane re¬ 
ferred to experiments performed by Allison, Clark, Stifler, and others as 
evidence against Compton’s conclusions. As Compton reports: “The re¬ 
sult was inconclusive. It was summarized by Sir C. V. Raman by this 
statement to me privately after the meeting. ‘Compton,’ he said, ‘you are a 
good debater, but the truth is not in you.’ Nevertheless, it seems to have 
been this discussion that stimulated Raman to the discovery of the effect 
which now bears his name. Duane followed up this meeting by a new 
interpretation of the change in wavelength which he attributed to what he 
called a ‘box’ effect, explaining that surrounding the scattering apparatus 
with a lead box had in some way altered the character of the radiation. 
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This interpretation I answered by repeating the experiment out-of-doors 
with essentially the same results, and at the same time Duane and his 
collaborators in a repetition of their own experiments began to find the 
spectrum line of the changed wavelength in accord with my collision 
theory. At the next meeting of the American Physical Society they report¬ 
ed a very good measurement of this change in wavelength.” 32 

In fact, during the Washington meeting (December 29 to 31, 1924) 
referred to by Compton, his experimental results and his hypothesis final¬ 
ly found unanimous consent. Ross and Webster 33 declared: “Duane’s ter¬ 
tiary peak has not been found in any of the experiments”; Compton, Bear¬ 
den, and Woo 34 refuted the “box elfect,” stating that “we thus find no 
measurable effect due to an enclosing box”; and, most important of all, 
Allison and Duane 35 admitted in their paper on “The Compton effect” 
that “Well marked broad peaks appeared on the curves, the positions of 
which corresponded approximately with Compton’s equation.” 

During the years 1921 to 1924, a period in which, as we have seen, the 
study of x-ray scattering led to the inescapable result of a quantum-cor¬ 
puscular hypothesis, it became clear that this hypothesis could be applied 
also to other optical phenomena which till then had been regarded as 
irrefutable evidence in favor of the undulatory conception of light. 

In 1921 Emden 36 showed that the assumption of quanta of light ac¬ 
counted not only for the Stefan-Boltzmann law, Wien’s displacement law, 
and Planck’s radiation law, but also for the Doppler effect, one of the most 
typically undulatory phenomena in optics. A more rigorous derivation of 
the Doppler effect on the basis of the quantum-corpuscular view was 
given in June, 1922, by Schrodinger, 37 who, incidentally, attributed phys¬ 
ical reality exclusively to waves after the advent of wave mechanics. The 
elementary Doppler formula could indeed be easily obtained on these 
assumptions, as the following rough calculation indicates. If it is assumed 
that the light quantum hv is emitted in the direction of the motion of 
the atom of mass m and velocity v x before the emission and velocity v 2 
after the emission, the principle of energy conservation entails that 
hdv = (m/2) (Uj 2 — v 2 2 ) and that of momentum conservation that mv x 
= mv 2 + ( hv/c ). A combination of these two equations leads to the re¬ 
sult that the relative frequency shift dv/v is given by (v x -t- v 2 )/2c, which 
is Doppler’s formula, a positive velocity corresponding to a motion to¬ 
ward the observer and resulting in an apparent frequency increase. 

Soon Maxwell’s equations themselves, the very foundation of the un¬ 
dulatory theory, became the target of similar attacks. Only a month after 
the publication of Schrodinger’s paper Oseen 38 demonstrated that Max¬ 
well’s equations admit solutions which agree to an arbitrary degree of 
approximation with the corpuscular view in so far as the emitted energy 
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can be regarded as being concentrated within an extremely narrow solid 
angle. Oseen showed that this angle can be taken so small that, up to a 
fractional part of the order of 10 “ 10 , the total energy emitted from a point 
source on the earth converges at a distance like that of Sirius on an area not 
larger than that of a small coin (“Fiinfpfennigstiick”). 

The next undulatory property to yield to corpuscular interpretations 
was the diffraction by an infinite (reflecting) line grating as shown, cur¬ 
iously enough, by Duane when he was still a staunch opponent of Comp¬ 
ton. Taking as x axis a direction in the plane of the grating perpendicular 
to its lines, Duane 39 treated the translational motion of the grating along 
this axis under the impact of the incident quanta of light as a periodic 
motion with a period defined by the translation over the grating constant 
a. Applying the quantum condition SoP dx = nh , Duane concluded that a 
transfer of momentum p to the grating occurs only in integral multiples of 
h /a . Since the energy loss of the light quantum hv f impinging at an angle 
of incidence 6 , can be neglected in view of the large mass of the grating, the 
quantum will undergo no change in its frequency when reflected at an 
angle 6 '. From the principle of conservation of momentum it then follows 
that ( hv/c) (sin 0 — sin 6 ') =nh /a , which is precisely Bragg’s famous 
formula a (sin 0 — sin 0 ') = nA. Duane could easily generalize his result 
for the reflection of x-rays from infinite three-dimensional crystals. “In 
the above discussion of the reflection of x-rays by crystals,” he declared, 
“nothing has been specifically said about wave-lengths or frequencies of 
vibration, although in order to conform with usage the energy of the inci¬ 
dent radiation quantum was put equal to hv , and in order to compare the 
formulas obtained with the usual expressions of them, the wavelength A 
was introduced. We might just as well have called the energy e and intro¬ 
duced it into the equation.” 40 

As Duane’s work and Compton’s 41 elaborations of it showed, Fraun¬ 
hofer diffraction phenomena could be accounted for in terms of the quan¬ 
tum-corpuscular theory of light. All attempts, however, to obtain similar 
results for finite diffracting systems or Fresnel diffraction phenomena 
proved unsuccessful. Epstein and Ehrenfest, 42 who made an extensive 
study of this problem, failed to solve it, mainly because they could not 
endow light quanta with characteristics corresponding to the undulatory 
properties of phase relations and coherence. 

The conflict between undulatory and corpuscular conceptions of light 
was further accentuated by the difficulty of applying quantum concep¬ 
tions to optical dispersion. For according to the perturbation theory, if 
applied to Bohr’s model of the atom, resonance with incident radiation 
should occur at the frequencies 43 2 r k v k of the Fourier components of 
motion. But this conclusion was incompatible with experience, such as, 
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for example, Wood’s 44 famous experiments on sodium vapor and Be- 
van’s 45 work on potassium vapor. 

Thus, a decision between the two competing theories on the nature of 
radiation could not be enforced. For the interpretation of optical pro¬ 
cesses involving the interaction between light and matter, the quantum- 
corpuscular view seemed indispensable whereas phenomena like interfer¬ 
ence and diffraction seemed to require the conceptual apparatus of the 
classical wave theory of light. This state of affairs was well characterized 
by Sir William Bragg when he said that physicists are using on Mondays, 
Wednesdays, and Fridays the classical theory, and on Tuesdays, Thurs¬ 
days, and Saturdays the quantum theory of radiation. The use of two such 
mutually exclusive, and therefore contradictory, notions was rightfully 
regarded as a serious dilemma at the very foundations of physical science. 
In fact, the situation was even worse! For it seemed unavoidable to define 
the corpuscular quantum of radiation in terms of frequency, frequency in 
terms of wavelength, and wavelength could be measured only by applying 
undulatory notions such as interference or diffraction. In other words, a 
hypothesis, supported by incontestable experimental evidence, became 
physically significant only by the use of its own negation. 


4.2 The Philosophical Background of Nonclassical 

Interpretations 

The older quantum theory was regarded either as “a rational generaliza¬ 
tion” of classical mechanics, as Bohr repeatedly called it, or as “a restric¬ 
tion” of classical mechanics, as Sommerfeld and Ehrenfest in view of the 
restrictive quantum conditions preferred. In any case, its metaphysical 
and epistemological foundations were generally held to be those of classi¬ 
cal physics. Thus, with the possible exception of Bohr’s insistence on 
methodological considerations, no specific philosophical study of its con¬ 
ceptual foundations seemed to be required. In fact, before the mid-twen¬ 
ties, philosophers of nature or of science regarded quantum theory as of no 
particular interest from their point of view; it was for them merely a 
particular branch of physics, just like, say, acoustics or electrochemistry. 
Nor did physicists recognize any need for philosophical analysis. As long 
as optical phenomena were relegated exclusively to the physics of waves or 
fields, and mechanical phenomena to the physics of particles or point 
masses, the classical dichotomy of particles and waves with its intrinsic 
problems was simply carried over from the Newtonian-Maxwellian pic¬ 
ture of reality into that of quantum theory. 
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As soon, however, as it became clear that the interpretation of optical 
phenomena required, as we have seen, the use of mutually exclusive no¬ 
tions, it was recognized that a critical examination and epistemological 
analysis of the foundations of the theory was a necessity. For if, as Cassirer 
once remarked, “each answer which physics imparts concerning the char¬ 
acter and the peculiar nature of its fundamental concepts assumes for 
epistemology the form of a question,” how much more so if physics itself 
faces inconsistencies in its foundations. 

Physicists traditionally refrain from declaring themselves as subscrib¬ 
ing to a particular school of philosophical thought, even if they are con¬ 
scious of belonging to it. The influence of a particular philosophical cli¬ 
mate on their scientific work, although often of decisive importance for 
the formation of new conceptions, is generally ignored. It is certainly true 
that philosophical considerations in their effect upon the physicist’s mind 
act more like an undercurrent beneath the surface than like a patent well- 
defined guiding line. It is the nature of science to obliterate its philosophi¬ 
cal preconceptions, but it is the duty of the historian and philosopher of 
science to recover them under the superstructure of the scientific edifice. 
For this task biographies, correspondence, and autobiographical remarks 
supply more information in general than the scientific publications them¬ 
selves. 

Such documentary material, if studied with scrutiny, will show that 
certain philosophical ideas of the late nineteenth century not only pre¬ 
pared the intellectual climate for, but contributed decisively to, the forma¬ 
tion of the new conceptions of the modem quantum theory. In particular, 
the writings of Renouvier, Boutroux, Kierkegaard, and Hdffding seem to 
have been influential in this respect. Following Cournot, 46 for whom natu¬ 
ral laws as construed by science are only approximations since physical 
reality itself is subject to contingency 47 —an event being contingent if its 
opposite involves no contradiction—Renouvier was one of the earliest 
modem thinkers who questioned the strict validity of the causality prin¬ 
ciple as a regulative determinant of physical processes. Renouvier’s point 
of departure was his rejection of actual infinity, which was for him “a 
logical self-contradiction and an empirical falsehood.” 48 With it he also 
rejected continuity since continuity presupposes an actual infinity of gra¬ 
dations. Furthermore, since the establishment of a causal relation be¬ 
tween two distinct events, especially if separated in space and time, rests 
on the possibility of conceiving these two events as connected with each 
other by a continuous chain of intermediate events, Renouvier directed 
his polemics against the acceptance of causality both as an idealistic cate¬ 
gory of the understanding in the Kantian sense, that is, as a principle 
without which experience of an intelligible world would be impossible, 
and as a realistic principle of order in the cosmos. To account for the 
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incontestably cognitive function of science, in spite of this rejection of 
causality, Renouvier proposed a phenomenalism according to which all 
that we immediately know is but a particular phenomenon or “representa¬ 
tion.” Every representation has a two-fold character: it is a “representing” 
and a “represented” (“representatif” and “represente”); it is an experi¬ 
ence of something and it is something experienced. Realism, Renouvier 
contends, is erroneous in so far as it assumes that the object can be di¬ 
vorced from its representation, and idealism is erroneous in so far as it 
assumes that there exist representations with nothing to represent. To 
avoid these errors, experience, thus doubly qualified, should be taken as 
self-sufficient and as constituting ultimate reality itself. 49 

Similar ideas and, in particular, a philosophy of nature based on con¬ 
tingency were advanced by Boutroux, who declared even in his thesis: 
“Analysing the notion of natural law, as seen in the sciences themselves, I 
found that this law is not a first principle but rather a result; that life, 
feeling, and liberty are true and profound realities, whereas the relatively 
invariable and general forms apprehended by science are but the inade¬ 
quate manifestations of these realities.” 50 “All experimental finding is 
reduced, in the end, to confining within as close limits as possible the value 
of the measurable elements of phenomena. We never reach the exact 
points at which the phenomenon really begins and ends. Moreover, we 
cannot affirm that such points exist, except, perhaps, in indivisible in¬ 
stants; a hypothesis which, in all probability, is contrary to the nature of 
time itself. Thus we see, as it were, only the containers of things, not the 
things themselves. We do not know if things occupy, in their containers, 
an assignable place. Supposing that phenomena were indeterminate, 
though only in a certain measure insuperably transcending the range of 
our rough methods of reckoning, appearances would none the less be 
exactly as we see them. Thus, we attribute to things a purely hypothetical 
if not unintelligible determination when we interpret literally the principle 
by which any particular phenomenon is connected with any other particu¬ 
lar phenomenon.” 51 

The rejection of classical determinism at the atomic level played an 
important role also in the philosophical system of Charles Sanders Peirce, 
who contended, as early as 1868, that “nature is not regular” and added 
immediately: “No disorder would be less orderly than the existing ar¬ 
rangement.” 52 His theory of tychism (rv^V = chance) according to 
which “chance is a factor in the universe,” 53 formulated in the late eighties 
and publicized in the early nineties, 54 was for Peirce a logical conclusion 
from the fact that deterministic mechanics with its necessarily reversible 
laws cannot account for the undeniable existence of growth and evolution 
in nature. “The law of the conservation of energy,” he wrote, 55 “is equiva- 
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lent to the proposition that all operations governed by mechanical laws 
are reversible; so that an immediate corollary from it is that growth is not 
explicable by these laws, even if they be not violated in the process of 
growth.” The deterministic or, as he often called it, necessitarian philoso¬ 
phy is therefore alone unable to explain reality. 

Another incontestable argument for the doctrine of tychism was in his 
opinion the incapability of the necessitarians to prove their position em¬ 
pirically by the observation of nature. The essence of their position, he 
maintained, was “that certain continuous quantities have certain exact 
values.” But how can observation determine the value of such a quantity 
“with a probable error absolutely nil 7” And now—almost anticipating 
Heisenberg’s uncertainty principle—he analyzed the process of experi¬ 
mental observation and concluded that absolute chance, and not an inde¬ 
terminacy arising merely from our ignorance, is an irreducible factor in 
physical processes and hence an ultimate category for the existential uni¬ 
verse. “Try to verify any law of nature, and you will find that the more 
precise your observations, the more certain they will be to show irregular 
departures from the law. We are accustomed to ascribe these, and I do not 
say wrongly, to errors of observation; yet we cannot usually account for 
such errors in any antecedently probable way. Trace their causes back far 
enough and you will be forced to admit they are always due to arbitrary 
determination, or chance.” 56 

For the time being ideas of this kind were generally regarded as merely 
philosophical speculations and scientists did not take them too seriously. 
A notable exception before the advent of modem quantum mechanics was 
the Viennese physicist Exner, who proposed in 1919 a statistical interpre¬ 
tation of the apparent deterministic behavior of macroscopic bodies as 
resulting from a great number of probabilistic processes at the submicros- 
copic level. In the ninety-third of his Lectures on the Physical Foundations 
of Science Exner 57 declared: “From a multitude of events... laws can be 
inferred which are valid for the average state (‘Durchschnittszustand’) of 
this multitude (macrocosm) whereas the individual event (microcosm) 
may remain undetermined. In this sense the principle of causality holds 
for all macroscopic occurrences without being necessarily valid for the 
microcosm. It also follows that the laws of the macrocosm are not abso¬ 
lute laws but rather laws of probability; whether they hold always and 
everywhere remains to be questioned....” 58 He continued: “... every single, 
however specialized, physical measurement produces only an average val¬ 
ue resulting from billions of individual motions... but to predict in physics 
the outcome of an individual process is impossible.” 59 

It should be clear, of course, that such probabilistic conceptions as 
advanced by Boutroux and Exner differ fundamentally from the tradi- 
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tional notions of probability as used, for example, in classical statistical 
mechanics. In classical physics probability statements were but an expres¬ 
sion of human ignorance of the exact details of the individual event, either 
because of the insufficient resolving power of our measuring instruments 
or because of the huge number of events involved; the individual physical 
process, however, was always regarded as strictly obeying the law of cause 
and effect, and its result was always considered as uniquely determined. 
The new conception of probability, on the other hand, assumed not only 
that macroscopic determinism is a statistical effect but also that the indi¬ 
vidual microscopic or submicroscopic event is purely contingent. 

Although only a small number of scientists subscribed to these new 
conceptions, their effect on physical thought should not be underestimat¬ 
ed. Even such a staunch proponent of classical thought as Henri Poincare 
was not completely immune against their impact. For when Poincare, 
perhaps the last of the great theoreticians in classical physics, returned 
from the 1911 Solvay Congress in Brussels, he wrote: “It is well known to 
what hypothesis Planck was led by his researches on the laws of radiation. 
According to these the energy of radiators of light varies in a discontin¬ 
uous manner. It is this hypothesis which is called the theory of quanta. It is 
hardly necessary to remark how these ideas differ from traditional con¬ 
ceptions; physical phenomena would cease to obey laws expressible by 
differential equations and this, undoubtedly, would be the greatest and 
most radical revolution in natural philosophy since the time of New¬ 
ton.” 60 If this statement was obviously provoked in response to the discus¬ 
sions at the Congress, a passage in Poincare’s Valeurde la Science , which 
had already attracted the attention of de Broglie, 61 shows that probabilis¬ 
tic considerations engaged the French mathematician at least as early as 
1904. For he wrote: “Facts which appear simple to us will be only the 
result of a very large number of elementary facts which the law of chance 
alone will lead to a single goal. In that case a physical law will take on a 
wholly new aspect; it would no longer be a differential equation; it would 
take on the character of a statistical law.” 62 Similarly, in his Dernieres 
Pensees , six months before his death, Poincare declared: “We now wonder 
not only whether the differential equations of dynamics must be modified, 
but whether the laws of motion can still be expressed by means of differen¬ 
tial equations.... It is being asked whether it is not necessary to introduce 
discontinuities into the natural laws, not apparent ones but essential 
ones.” 63 It is clear that Poincare’s question whether differential equations 
are still the appropriate instrument for the mathematical formulation of 
physical laws was but a mathematician’s way of expressing his doubts in 
the validity of the causality principle. For, obviously, the positing of a 
differential equation presupposes a continuous change or a continuous 
chain of events as implied in the conception of causality. 
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There seems to be no doubt that Renouvier’s and Boutroux’s works 
had been read by the young Poincare just as there is no doubt that Poin¬ 
care’s writings on the philosophy of science had been read by the origina¬ 
tors of modem quantum physics. 64 The influence of Poincare’s ideas on 
the younger generation of mathematical physicists is well exemplified by 
Charles Galton Darwin, whose father, Sir George Darwin, already re¬ 
ferred to the French mathematician as “the presiding genius—or, shall I 
say, my patron saint?” Charles Galton Darwin was so much impressed by 
Poincare’s article “Sur la theorie des quanta” 65 that in August, 1914, he 
translated it into English. 66 Whenever facing a problem of general impor¬ 
tance, Darwin asked himself how Poincare would have thought about it. 
In a letter to Bohr, for example, in which he stressed the need for sound 
analytical conclusions and inductive reasoning, Darwin wrote in 1919: “I 
wish Poincare were alive, as he could do that kind of argument like no one 
else.” 67 

C. G. Darwin was one of the first to search for a logically consistent 
formulation of the theoretical foundations of quantum theory. In an un¬ 
published paper entitled “Critique of the foundations of physics” 68 he 
wrote in 1919: “I have long felt that the fundamental basis of physics is in a 
desperate state. The great positive successes of the quantum theory have 
accentuated all along, not merely its value, but also the essential contra¬ 
dictions over which it rests.... It may be that it will prove necessary to 
make fundamental changes in our ideas of time and space, or to abandon 
the conservation of matter and electricity, or even in the last resort to 
endow electrons with free will.” Because of the war only now was the 
physical significance of Millikan’s 69 brilliant experimental confirmation 
of the quantum-corpuscular interpretation of the photoelectric effect fully 
recognized. Since an explanation in terms of the classical wave theory 
conflicted with the principle of energy conservation, 70 it was only natural 
that Darwin studied at first the possibilities “to be deduced from denying 
this principle,” arguing “that the most promising outlook for the recon¬ 
ciliation of our difficulties is to suppose that energy is not exactly con¬ 
served.” And Darwin continued: “Now, no one can doubt that energy is 
approximately conserved, but ordinary dynamical and electrical experi¬ 
ments only establish it statistically, and so really only put it on the same 
footing as the second law of thermodynamics.” In paragraph 5 of the 
unpublished paper Darwin discussed the consequences of denying the 
conservation of energy but reached no satisfactory results. 

In the following years Darwin became particularly interested in the 
problem of how to reconcile the phenomena of optical dispersion with the 
theory of quanta. 71 It was in this connection that, probably for the first 
time in the history of physics, a solution was proposed which was based on 
extending the existing conceptual apparatus of theoretical physics. “It 
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must be taken as absolutely certain that both the electromagnetic theory 
and the quantum theory are valid in their respective fields, and equally 
certain that the two descriptions are incompatible. We can only conclude 
that they are parts of an overriding system, which would give rise to 
mathematical formulae identical with those of the present theory.” 72 It is 
hard to say whether this statement suggested a synthesis of the two dispa¬ 
rate conceptions, particle and wave, at a higher level, as the use of the term 
“overriding” seems to imply, 73 or whether it referred merely to the estab¬ 
lishment of an abstract mathematical formalism which comprises quan¬ 
tum theory as well as wave theory, as his emphasis on “mathematical 
formulae” seems to indicate. But, whatever its meaning, Darwin’s sugges¬ 
tion of extending the existing conceptual scheme was not heeded at the 
time and had to wait for its realization until further experimental and 
theoretical research prepared the way for the conception of an entity 
which manifests itself as a wave or as a particle depending upon the experi¬ 
mental conditions under which it is observed. The development of this 
conception of physical reality, which is intimately connected with the 
name of Niels Bohr and the so-called “Copenhagen interpretation” of 
quantum physics, is the culmination of certain epistemological ideas 
whose historical origin and philosophical background will now be dis¬ 
cussed. 

There can be no doubt that the Danish precursor of modem existential¬ 
ism and neoorthodox theology, Stfren Kierkegaard, through his influence 
on Bohr, affected also the course of modem physics to some extent. This 
can be gathered not only from certain references and allusions in Bohr’s 
philosophically oriented writings but also from the very fact that Harald 
Htfffding, 74 an ardent student and brilliant expounder of Kierkegaard’s 
teachings, was the principal authority on philosophical matters for the 
young Bohr. We know that Bohr attended Htfffding’s lectures at the Uni¬ 
versity of Copenhagen and read Hiding’s writings 75 in which Kierke¬ 
gaard’s philosophy occupied a prominent place. Furthermore, Htfffding 
and Niels’s father,. Christian Bohr, a professor of physiology at the Uni¬ 
versity of Copenhagen, were not only colleagues but also intimate friends 
who had much in common; Christian Bohr’s anti-Haeckelism and Harald 
Hiding’s anti-Hegelianism placed both of them in opposition to contem¬ 
porary thought. 

Kierkegaard’s philosophy of life and religion, his so-called “qualitative 
dialectic,” his antithesis between thought and reality, his contradictory 
conceptions of life, and his insistence on the necessity of choice had appar¬ 
ently left a deep impression on Bohr’s youthful mind. In particular Kier¬ 
kegaard’s emphasis on the practical value of thought, his opposition to the 
construction of systems, and his insistence that thought could never attain 
reality, for as soon as it thought to have done so it falsified reality by 
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having changed it into imagined reality—all these ideas contributed to the 
creation of a philosophical climate which facilitated the surrender of clas¬ 
sical conceptions. 

Of particular importance for Niels Bohr was Kierkegaard’s idea, re¬ 
peatedly elaborated by HtffFding, that the traditional speculative philoso¬ 
phy, in its claim to being capable of explaining everything, forgot that the 
originator of the system, however unimportant he may be, forms part of 
the being which is to be explained. “A system can be conceived only if one 
could look back on completed existence—but this would presuppose that 
one no longer exists.” 76 Man cannot without falsification conceive of him¬ 
self as an impartial spectator or impersonal observer; he always necessar¬ 
ily remains a participant. Thus man’s delimitation between the objective 
and the subjective is always an arbitrary act and man’s life a series of 
decisions. Science is a determinate activity and truth a human product, 
not only because it is man who created knowledge, but because the very 
object of knowledge is far from being a thing ready-made from all eternity. 

One cannot fail to note that Hiding’s discussion of the problem of 
knowledge seems to foreshadow certain conceptual traits in later quan¬ 
tum mechanics, as, for example, when he declares: “In an earlier connec¬ 
tion I made use of Schiller’s words: ‘Wide is the brain and narrow is the 
world’; and now the sentence can be reversed: ‘Wide is the world and 
narrow is the brain.’ Knowledge, however rich and powerful it may be, is 
after all only a part of Being; and the problem of knowledge would be 
soluble, only if Being as a totality (in so far as we can conceive it as such a 
totality) could be expressed by means of a single one of its parts. In any 
event, our expression must always remain symbolic, even if our knowl¬ 
edge reaches its climax; it gives us only an extract from a more inclusive 
whole. Among all the possibilities of thought, only a single one appears in 
the reality as recognized by us. The reality which we recognize is, how¬ 
ever, only a part of a greater whole;—and here we are not in a position to 
determine the relation between the parts and the whole. An exhaustive 
concept of reality is not given us to create.” 77 

In life only sudden decisions, leaps, or jerks can lead to progress. 
“Something decisive occurs always only by a jerk, by a sudden turn which 
neither can be predicted from its antecedents nor is determined by 
these.” 78 Referring to Kierkegaard’s indeterministic theory of “leaps,” 
Htfffding called the Danish philosopher “the only indeterministic thinker 
who attempted to describe the leap” 79 but added later: “It seems to be 
clear that if the leap occurs between two states or two moments, no eye can 
possibly observe it, and since it therefore can never be a phenomenon, its 
description ceases to be a description.” 80 Consequently, also “causality 
cannot be described.” 81 
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The preceding remarks and quotations do not, of course, intend to 
present a coherent and comprehensive exposition of the ideas of Kierke¬ 
gaard or IMfFding; their purpose is only to sketch the nature of some of 
these philosophical conceptions which later have been transferred into the 
realm of natural philosophy and have stimulated a new approach for the 
epistemological foundations of quantum theory. 

That conceptions of this kind did indeed affect the new epistemological 
view of scientific inquiry was admitted by Bohr himself when he de¬ 
scribed 82 how he was influenced by Poul Martin Miller’s Adventures of a 
Danish Student , 83 a novel “still read with delight by the older as well as the 
younger generation in this country.” In this novel Bohr found a “vivid and 
suggestive account of the interplay between the various aspects of our 
position, illuminated by discussions within a circle of students with differ¬ 
ent characters and divergent attitudes of life.” 84 Miller described how a 
licentiate of a critical, cautious, and contemplative character and attitude 
of mind is reproached by his practical-minded cousin for not having taken 
advantage of the opportunities of finding a practical job; in reply to these 
admonitions the author made the licentiate declare: “My endless inquiries 
made it impossible for me to achieve anything. Moreover, I get to think 
about my own thoughts of the situation in which I find myself. I even 
think that I think of it, and divide myself into an infinite retrogressive 
sequence of T’s who consider each other. I do not know at which T to stop 
as the actual, and as soon as I stop at one, there is indeed again an T which 
stops at it. I become confused and feel giddy as if I were looking down into 
a bottomless abyss, and my ponderings result finally in a terrible head¬ 
ache.” Whereupon the cousin replied: “I cannot help you a bit in sorting 
your numerous T’s. This is beyond my sphere of action, and I would either 
be or become as mad as you if I let myself take part in your superhuman 
reveries. My line is to stick to tangible things and walk along the broad 
highway of common sense; thus my egos never get confused.” 

It can hardly be doubted that such an introspective analysis, derisive of 
some current trends in European philosophy and thus akin to Kierke¬ 
gaard’s critique, had an impact on Bohr’s “rediscovery of the dialectical 
process of cognition which had so long been obscured by the unilateral 
development of epistemology on the basis of Aristotelian logic and Pla¬ 
tonic idealism,” as L. Rosenfeld 85 once expressed it. Whether Rosenfeld is 
right with his remark that “Poul Miller had little imagined that his light- 
handed banter would one day start a train of thought leading to the eluci¬ 
dation of the most fundamental aspects of atomic theory and the renova¬ 
tion of the philosophy of science” 86 or whether it was originally 
Kierkegaard’s and IMffding’s insistence on the arbitrary but necessary act 
of drawing a boundary line between subject and object in the process of 
scientific inquiry, these ideas, as we shall see later on, were of importance 
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for the future development of modem quantum theory. In spite of the 
lucid and logical presentation of scientific argumentations in his writings, 
Bohr was never fond of system constructing or axiomatization. Like 
Htfffding, who “gave more consideration to the proposition of problems 
than to their solution,” Bohr declared repeatedly: “Every sentence I say 
must be understood not as an affirmation, but as a question.” 87 Kierke¬ 
gaard’s and Hiding’s stress on the practical, pragmatic significance of 
truth 88 reverberated in Bohr’s frequent remark: “It is not the question at 
present whether this view is true or not, but what arguments we can hon¬ 
estly draw with respect to it from the available information.” 89 Kierke¬ 
gaard’s and Hiding’s insistence on the inevitable participation of the 
observer was unrestrictedly accepted by Bohr, for whom “man as an ana¬ 
lyzing subject always occupies the central position.” 90 Just as in psycholo¬ 
gy, Bohr pointed out, “it is clearly impossible to distinguish sharply be¬ 
tween the phenomena themselves and their conscious perception,” 91 so 
also in physical research the traditional distinction between the observer 
and the object observed has to be abandoned in atomic physics. With it 
also the validity of the classical description of atomic phenomena in terms 
of space and time becomes questionable. 

In an interesting exchange of letters between IMfFding and Niels 
Bohr, 92 following HdfFding’s presentation of the galleys of a new edition of 
his Formel Logik 93 to Bohr, in which a discussion of the principle of the 
excluded third played an important role, Bohr wrote 94 to Htfffding in 
1922: “On the other hand we encounter difficulties which lie so deep that 
we do not have any idea of the way of their solution; it is my personal 
opinion that these difficulties are of such a nature that they hardly allow 
us to hope that we shall be able, inside the world of the atoms, to carry 
through a description in space and time of the kind that corresponds to 
our ordinary images.” 95 

As we shall see, primarily in our discussion on the Copenhagen inter¬ 
pretation of quantum mechanics, Bohr was strongly influenced also by 
William James. Bohr repeatedly admitted how impressed he was particu¬ 
larly by the psychological writings of this American philosopher. He be¬ 
came acquainted with the teachings of James probably through Hdffding, 
who had visited James in Cambridge (Massachusetts) when attending the 
St. Louis International Congress of Arts and Science in the fall of 1904. 96 
James, it will be remembered, brought C. S. Peirce’s pragmatic philoso¬ 
phy in metaphysics, ethics, religion, and the theory of knowledge to its 
logical conclusion. James’s well-known maxim “We must find a theory 
that will work” still has its place in modem physics. 

It is interesting to note, from our point of view, that Renouvier’s con- 
tingentism and phenomenalism with its implications on the problem of 
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free will was a turning point in James’s intellectual and personal history— 
in fact, to such an extent that, in 1870, Renouvier’s philosophy relieved 
him from his phobic panic and from his “monistic superstition.” 97 James 
dedicated his posthumously published book Some Problems of Philoso¬ 
phy 98 to Renouvier, calling him “one of the greatest of philosophic char¬ 
acters.” 99 

An interesting discussion on discontinuity and indeterminism and on 
the relation of these notions to the principle of energy conservation is 
found in the correspondence between Renouvier and James. 100 “Plural¬ 
ism and indeterminism,” wrote James, 101 “seem to be but two ways of 
stating the same thing.” Renouvier fully agreed. 102 In a note enclosed in a 
letter to James 103 Renouvier explained how in his view the concept of 
discontinuous motion contradicts the idea of energy conservation; for the 
latter, he contended, is a consequence of the laws of motion; but these do 
not admit of discontinuous solutions. 104 These remarks were written in 
opposition to Joseph Delbceuf s provocative essay on “Determinism and 
Freedom—Freedom demonstrated by Mechanics,” 105 in which the Bel¬ 
gian mathematician attempted to show, on the basis of Newtonian me¬ 
chanics, not only that discontinuous motions do exist, but also that their 
existence can be reconciled with the principles of energy conservation and 
increase of entropy. 106 

As we shall see later on, James’s work in psychology, which gave rise to 
a great variety of new developments in this field, and especially his monu¬ 
mental Principles of Psychology, 101 for many years one of the most popular 
textbooks on this subject, was destined, interestingly enough, to play an 
important role also in the development of quantum mechanics. Written in 
a rich and vivid literary style and containing a wealth of new ideas and 
suggestions, the Principles , and in particular its ninth chapter, “The 
Stream of Thought,” was one of the major factors which influenced, wit¬ 
tingly or unwittingly, Bohr’s formation of new conceptions in physics. It is 
not impossible that Bohr’s idea of stationary states, as conceived in 1913, 
had already been influenced by the following passage from “The Stream of 
Thought”: “Like a bird’s life, [the stream of our consciousness] seems to 
be made of an alternation of flights and perchings. The rhythm of lan¬ 
guage expresses this, where every thought is expressed in a sentence, and 
every sentence closed by a period. The resting-places are usually occupied 
by sensorial imaginations of some sort, whose peculiarity is that they can 
be held before the mind for an indefinite time, and contemplated without 
changing; the places of flight are filled with thoughts of relations, static or 
dynamic, that for the most part obtain between the matters contemplated 
in the periods of comparative rest. Let us call the resting-places the ‘ sub¬ 
stantive parts' and the places offlight the ‘transitive parts,' of the stream of 
thought. It then appears that the main end of our thinking is at all times the 
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attainment of some other substantive part than the one from which we 
have just been dislodged. And we may say that the main use of the transi¬ 
tive parts is to lead us from one substantive conclusion to another. Now it 
is very difficult, introspectively, to see the transitive parts for what they 
really are. If they are but flights to a conclusion, stopping them to look at 
them before the conclusion is reached is really annihilating them. Whilst if 
we wait till the conclusion be reached, it so exceeds them in vigor and 
stability that it quite eclipses and swallows them up in its glare. Let anyone 
try to cut a thought across in the middle and get a look at its section, and 
he will see how difficult the introspective observation of the transitive 
tracts is. The rush of the thought is so headlong that it almost always 
brings us up at the conclusion before we can arrest it. Or if our purpose is 
nimble enough and we do arrest it, it ceases forthwith to be itself. As a 
snow-flake crystal caught in the warm hand is no longer a crystal but a 
drop, so, instead of catching the feeling of relation moving to its term, we 
find we have caught some substantive thing, usually the last word we were 
pronouncing, statically taken, and with its function, tendency, and partic¬ 
ular meaning in the sentence quite evaporated. The attempt at introspec¬ 
tive analysis in these cases is in fact like seizing a spinning top to catch its 
motion, or trying to turn up the gas quickly enough to see how the dark¬ 
ness looks.” 108 

This vivid description of the impossibility of observing introspectively 
the “transitive parts” in the stream of thought and feelings left a strong 
impression on Bohr’s mind, as he admitted himself. Numerous allusions 
to the difficulties of introspective observation and frequent comparisons 
between physics and psychology in Bohr’s later writings seem to indicate 
that some of his ideas—for instance his conceptions concerning the arbi¬ 
trary boundary line between the observer and the object of observation, 
the incontrollable interaction between these two and its epistemological 
implications—have been inspired by James’s philosophy. 

To complete our survey on the philosophical background of modem 
quantum physics, the rise of the logical positivism in the twentieth century 
also has to be considered, being one of the factors without which the 
development of the new conceptual situation in quantum physics can 
hardly be understood. Although the Vienna Circle (“Wiener Kreis”) 
eptered publicity as an independent school of philosophy only at the end 
of the twenties, 109 the impact of logical empiricism on theoretical physics 
can be recognized much earlier. 

Ever since a chair for the philosophy of the inductive sciences was 
established in 1895 at the University of Vienna, theoretical physics has 
been subjected to critical examinations by empiricistic philosophy. After 
Ernst Mach it was primarily Moritz Schlick—Schlick’s doctor’s thesis, 
under the guidance of Max Planck, was on optical reflection in nonhomo- 
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geneous media—who exerted great influence in this respect, especially 
after the publication of his Space and Time in Contemporary Physics. 110 
This monograph stimulated great interest in the epistemological founda¬ 
tions of physics and, although it dealt primarily with the foundations of 
relativity, its influence on theoretical physics in general should not be 
ignored. Schlick entertained personal contact with such leading physicists 
and mathematicians as Einstein, Planck, and Hilbert, and his conceptions 
concerning epistemological questions in physics, in particular, his insis¬ 
tence that a statement is meaningful only if it is empirically verifiable, 
became influential among physicists in the early twenties. Since 1921 
Wittgenstein’s Tractatus Logico-Philosophicus 111 had evoked much dis¬ 
cussion not only among professional philosophers alone. Its famous con¬ 
cluding statement “Whereof one cannot speak, thereof one must be silent” 
was soon to find, as we shall see in due course, an unexpected application 
in Heisenberg’s new approach to the study of atomic phenomena. 

Although only the later development of quantum mechanics gave rise 
to the present unprecedented interest in the philosophy of science, philos¬ 
ophy’s original contributions to the development of the physical theory 
should not be ignored. It made physics disposed toward a logical clarifica¬ 
tion and methodological purification and forced physicists “to come to 
grips with philosophical problems to a greater degree than was ever the 
case in earlier generations.” 112 

The philosophical schools we have mentioned, contingentism, existen¬ 
tialism, pragmatism, and logical empiricism, rose in reaction to tradition¬ 
al rationalism and conventional metaphysics whose ultimate roots were in 
the philosophy of Descartes. Their affirmation of a concrete conception of 
life and their rejection of an abstract intellectualism culminated in their 
doctrine of free will, their denial of mechanical determinism or of meta¬ 
physical causality. United in rejecting causality, though on different 
grounds, these currents of thought prepared, so to speak, the philosophi¬ 
cal background for modem quantum mechanics. They contributed with 
suggestions to the formative stage of the new conceptual scheme and sub¬ 
sequently promoted its acceptance. 

In concluding this section we wish to point out that much has been 
published on the influence of physics and, in particular of quantum me¬ 
chanics on current philosophical thought, but the converse problem the 
study of how far philosophy contributed to current physical thought, has 
rarely been dealt with. But physical thought, even at its great turning 
points when treading unwonted paths in its formation of new conceptions, 
never works in a vacuum. Even “discontinuities” in its intellectual devel¬ 
opment are somehow related to general contemporary thought. We are 
fully aware that the present treatment of this problem is far from complete 
and, to some extent, even conjectural in its assertions. It is hoped that 
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further research will throw additional light on these still obscure relations 
between recent philosophy and modem physics. 


4.3 Nonclassical Interpretations of Optical Dispersion 

The transition from the older quantum theory to modem quantum me¬ 
chanics occurred within a surprisingly short period of time. But it was not 
one single conceptual process. It was not even “a straight staircase up¬ 
wards,” but rather “a tangle of interconnected alleys,” as Bom once char¬ 
acterized 113 it so fittingly. Correspondingly, also the surrender of classical 
notions and their replacement by fundamentally new conceptions, whose 
philosophical background was the subject of the preceding section, did 
not precipitate in one single movement, but took place rather piece-meal, 
though at a rapid pace. 

The first step in this development, though only soon to be retraced, was 
in connection with the phenomenon of dispersion, which, as we have seen, 
was one of the most difficult problems to reconcile with quantum concep¬ 
tions. But—perhaps just because of this difficulty—it proved most instru¬ 
mental for the formation of new conceptions. 

The glaring contrast between the discrete features of the quantum theo¬ 
ry of atomic structure, on the one hand, and the continuous aspects of the 
electromagnetic theory of light, on the other, was one of the most chal¬ 
lenging problems of the time. It was early realized that at the root of the 
difficulties to reconcile these two branches of physics was the principle of 
the exact conservation of energy and momentum. For it was hard to un¬ 
derstand how, for example, in a system composed of an electromagnetic 
radiation field, susceptible of only continuous changes of energy, and an 
aggregate of atoms, emitting or absorbing only discrete quanta of energy, 
the sum total of a continuous and of a discrete amount of energy could be a 
constant. Viewed from this angle, Einstein’s conception of discrete light 
quanta, by which he explained the energy balance for the interaction be¬ 
tween matter and radiation, was essentially a redefinition of energy. 

But there was an alternative: one could reject the energy principle as an 
exact law and regard it instead as merely a statistical law. This alternative, 
as we have seen, 114 had already been envisaged in 1919 by Darwin as a 
possibility to harmonize Bohr’s quantum theory of the atom with Max¬ 
well’s theory of the field. In 1922 he applied this idea for the formulation 
of his theory of dispersion. 115 “It is, of course, a fact of observation,” he 
declared, “that, in the gross, energy is conserved, but this means an aver¬ 
aged energy; and as pure dynamics has failed to explain many atomic 
phenomena, there seems no reason to maintain the exact conservation of 
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energy, which is only one of the consequences of the dynamical equa¬ 
tions”—an argumentation strikingly similar to Renouvier’s reasoning in 
his letter to William James. 116 According to Darwin’s theory of dispersion 
an atom, if struck by an incident wave of light, acquires a probability of 
emitting a spherical wave train which interferes with the exciting light, the 
probability being a function of the intensity of the incident light. When 
Bohr, however, in a private letter, directed Darwin’s attention to certain 
inconsistencies and when it became clear that the dispersion in the case of 
very weak radiation cannot be accounted for, Darwin tried to modify his 
theory. 117 It was in the course of these attempts that he made the state¬ 
ment concerning the “overriding system” which we have mentioned 
above. 118 

The statistical element in Darwin’s approach seems to have eventually 
inspired J. C. Slater to new far-reaching ideas. In January, 1924, Slater 
published a note on “Radiation and atoms.” 119 Its declared purpose was 
“to build up a more adequate picture of optical phenomena than has 
previously existed.” Convinced of the correctness of Compton’s results 
concerning the existence of light quanta, Slater attempted “to tie togeth¬ 
er” light quanta and waves by constructing, on the basis of Poynting’s 
vector and the energy density of the electromagnetic field, a quantity 
which transforms like a vector and thus could represent the velocity of the 
light quantum. After a careful study of this problem and, in particular, 
after having read Cunningham’s treatise on Relativity , the Electron Theo¬ 
ry and Gravitation , 120 the second edition of which had just been published, 
Slater saw the futility of this program. He therefore concluded that the 
only possibility to establish such a relation would be to assume a statistical 
connection between light quanta and waves. 

The note itself, however, hardly referred to light quanta but suggested 
that each atom should be supposed to “communicate” constantly with all 
the other atoms as long as it is in one of its stationary states, this communi¬ 
cation being established by a virtual field of radiation which acts as if it 
originates from oscillators whose frequencies are those of the possible 
quantum transitions of the given atom. “The part of the field originating 
from the given atom itself is supposed to induce a probability that that 
atom lose energy spontaneously, while radiation from external sources is 
regarded as inducing additional probabilities that it gain or lose energy 
much as Einstein has suggested. The discontinuous transition finally re¬ 
sulting from these probabilities has no other external significance than 
simply to mark the transfer to a new stationary state, and the change from 
the continuous radiation appropriate to the old state to that of the new.” 

Thus, by assuming a virtual radiation field through which distant 
atoms are capable of communicating with each other even before transi- 
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tions between stationary states take place, Slater attempted to reconcile 
the discrete theory of light quanta with the continuous wave theory of the 
electromagnetic field. However, when Slater, visiting Bohr’s Institute at 
that time, expounded the details of his theory in Copenhagen, Kramers 
pointed out that instead of leading to a strong coupling between emission 
and absorption processes among distant atoms, Slater obtained, contrary 
to his intentions, a greater independence of these processes than previous¬ 
ly conceived. Furthermore, Kramers, like Bohr, still regarded Einstein’s 
theory of light quanta, in spite of its acknowledged heuristic value as 
shown by its confirmation through the photoelectric effect, as an unsatis¬ 
factory solution of the problem of light propagation, if only because “the 
radiation ‘frequency’ v appearing in the theory is defined by experiments 
on interference phenomena which apparently demand for their interpre¬ 
tation a wave constitution of light.” Although certain details of Slater’s 
hypothesis had therefore to be rejected, a theory of virtual oscillators and 
of an associated virtual radiation field was worked out so as to make it 
possible to harmonize the physical picture of the continuous electromag¬ 
netic field with the physical picture, not as Slater proposed of light quanta, 
but of the discontinuous quantum transitions in the atom. For, having 
renounced the possibility of any time-spatial description of transitions 
between stationary states, these transitions, “jumps” or “leaps,” were for 
Bohr and Kramers discontinuous processes whereas at the same time the 
propagation of light, as just explained, was to be regarded as a continuous 
phenomenon. A statistical connection, as Slater proposed, was the only 
conceivable possibility of correlating continuity with discreteness. More¬ 
over, such a correlation had also to renounce any causal connection for, as 
explained in the preceding section, causality presupposes continuity. Fin¬ 
ally, the replacement of causal connectivity by statistical considerations 
made it logically unavoidable to degrade the principles of energy and 
momentum conservation to the status of statistical theorems. 

Now the problem was how to formulate on the basis of such assump¬ 
tions a consistent theory which could account for the observed phenome¬ 
na of dispersion, refraction, and reflection. The program of such a theory 
was published by Bohr, Kramers, and Slater 121 in a paper entitled “The 
quantum theory of radiation.” 122 Although soon disproved, as we shall 
see, on the basis of experimental evidence, this paper proved extremely 
important for the conceptual development of quantum theory for the fol¬ 
lowing reasons: (1) it was the first major paper in physics which deliber¬ 
ately and programmatically renounced methods of explanation as well as 
fundamental principles of classical theory; (2) it evoked, consequently, a 
great amount of discussion among physicists and directed thereby the 
attention of quantum theorists to questions concerning the epistemologi¬ 
cal foundations of atomic physics; 123 and (3) it was the point of departure 
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of Kramers’s detailed theory of dispersion whose further elaboration by 
Kramers and Heisenberg led the latter to the discovery of matrix mechan¬ 
ics, the first formulation of quantum mechanics. 

Bohr and Kramers assumed with Slater that every atom, as soon and as 
long as it occupies a given stationary state, communicates “continually 
with other atoms through a time-spatial mechanism which is virtually 
equivalent with the field of radiation which on the classical theory would 
originate from the virtual harmonic oscillators corresponding with the 
various possible transitions to other stationary states.” In other words, 
with each stationary state of a given atom a virtual radiation field is asso¬ 
ciated composed of as many monochromatic spherical waves as there are 
possible transitions to lower energy levels in the given atom in accordance 
with Bohr’s frequency condition. The radiation field is not causally con¬ 
nected with transitions between stationary states. It is rather assumed that 
“the occurrence of transition processes for the given atom itself, as well as 
for the other atoms with which it is in mutual communication, is connect¬ 
ed with this mechanism by probability laws,” so that there exists for each 
frequency component of the field a temporally invariant probability, pro¬ 
portional to the time element dt , that the transition, corresponding to the 
particular frequency, occurs within dt. “The occurrence of a certain tran¬ 
sition in a given atom will depend on the initial stationary state of this 
atom itself and on the states of the atoms with which it is in communica¬ 
tion through the virtual radiation field, but not on the occurrence of tran¬ 
sition processes in the latter atoms.” 

Bohr, Kramers, and Slater were able to show that in the limit, where 
the correspondence principle holds, the resulting relation between the 
virtual radiation field, on the one hand, and the motions of the particle, on 
the other, coincided with the state of affairs as described by the classical 
radiation theory. For characteristic quantum processes, however, they 
abandoned “any attempt at a causal connection between the transitions in 
distant atoms, and especially a direct application of the principles of con¬ 
servation of energy and momentum.” A transition in an atom A, accord¬ 
ing to their view, is not directly caused by a transition in a distant atom B 
nor is the energy difference between the initial and the final stationary 
states of A equal to that of B. “On the contrary, an atom which has 
contributed to the induction of a certain transition in a distant atom 
through the virtual radiation field conjugated with the virtual harmonic 
oscillator corresponding with one of the possible transitions to the other 
stationary state, may nevertheless itself ultimately perform another of 
these transitions.” For individual atomic processes neither energy 124 nor 
momentum is thus preserved. However, by relating the probability factor 
to the intensity of radiation precisely in such a way that the radiative 
energy of a given frequency, emitted during the average lifetime of the 
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excited state, is equal to the product of the actual energy loss of the atom 
and the probability (relative number of cases of occurrence) of the corre¬ 
sponding transition energy is conserved on the average. 

Applying their theory to the Compton scattering of radiation by free 
electrons, the authors of the paper under discussion had to conclude that 
the “illuminated electron possesses a certain probability of taking up in 
unit time a finite amount of momentum in any given direction.” 125 Thus, 
the statistical interpretation of this process implied that the direction of 
the recoil electron is not uniquely determined as it would have to be ac¬ 
cording to the momentum-conservation law but displays a wide statistical 
distribution. That this conclusion opened a possibility for an experimental 
test of the theory was recognized by Bothe and Geiger 126 in June, 1924. In 
their experiment, which they performed a few months later 127 and which 
was a pioneer work in those days before the development of modem elec¬ 
tronic coincidence techniques, they used two point counters, one for the 
reception of the scattered x-rays and the other for the recording of the 
recoil electrons; they found “that approximately every eleventh light 
quantum coincides with a recoil electron,” a result which, owing to the 
conditions of the experiment, showed that the chances of these coinci¬ 
dences being accidental are of the order 10 -5 . Bothe and Geiger thus 
concluded that their result “is incompatible with Bohr’s interpretation of 
the Compton effect.” 128 Meanwhile, also Compton and Simon, 129 employ¬ 
ing the technique of the cloud-chamber expansion for the determination 
of the direction and time of ejection of the recoil electrons, verified that 
“on the average there is produced one recoil electron for each quantum of 
scattered x-rays. Each recoil electron produces a visible track, and occa¬ 
sionally a secondary track is produced by the scattered x-ray. When but 
one recoil electron appears on the same plate with the track due to the 
scattered rays, it is possible to tell at once whether the angle satisfies” 
Eq. (4.4), page 169, as required by Compton’s theory of light quanta. It 
was therefore possible to obtain an “unequivocal answer to the question 
whether the energy of a scattered x-ray quantum is distributed over a wide 
solid angle or proceeds in a definite direction.” Compton and Simon 
found, just as Bothe and Geiger did, that “the results do not appear to be 
reconcilable with the view of the statistical production of recoil and 
photo-electrons by Bohr, Kramers and Slater. They are, on the other 
hand, in direct support of the view that energy and momentum are con¬ 
served during the interaction between radiation and individual elec¬ 
trons.” 130 

It is noteworthy to recall in this context that about ten years later an 
apparently negative result of the Compton-Simon experiment 131 led to 
speculations over a possible revival of the Bohr-Kramers-Slater statistical 
theory and stimulated renewed interest in carrying out the Compton- 
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Simon or Bothe-Geiger kind of experiments with improved instrumenta¬ 
tion for time resolution. Thus, Hofstadter and McIntyre, 132 using stilbene 
scintillation counters, demonstrated irrefutably “that the recoil electrons 
and the scattered photons are emitted together within a time interval of 
less than 1.5 X 10 -8 sec.” In the same year, Cross and Ramsey 133 showed 
that the Compton electrons are ejected within + 1 deg of the direction 
required by the conservation laws when the scattering direction of the 
associated quanta is specified. 

In this context it should be noted that Bohr, for the first time, probably 
under the impact of Compton’s discovery, had no objection to admitting 
and employing in the Bohr-Kramers-Slater paper the notion of momen¬ 
tum transfer of radiation and to associating it with quantum transi¬ 
tions, 134 and this in spite of his persistent refusal to accept the idea of light 
quanta. Bohr’s insistence on the exclusive validity of wave conceptions is 
well illustrated by his reaction to a letter in which Einstein, shortly after 
the publication of the paper under discussion, slightly criticized Bohr’s 
opposition to the conception of light quanta. Bohr is said to have replied 
that even if Einstein had sent him a telegram informing him that by now 
he (Einstein) had found an irrevocable proof of the physical existence of 
light quanta, even then, Bohr continued, “the telegram could only reach 
me by radio on account of the waves which are there.” 135 

It is hard to find in the history of physics a theory which was so soon 
disproved after its proposal and yet was so important for the future devel¬ 
opment of physical thought as the theory of Bohr, Kramers, and Slater. 
Now it will be understood that not because of its specific physical contents 
was it important, but because of its radically new approach. By interpret¬ 
ing Einstein’s spontaneous emission as a process “induced by the virtual 
field of radiation” and Einstein’s induced transitions as occurring “in 
consequence of the virtual radiation in the surrounding space due to other 
atoms,” 136 it paved the way for the subsequent quantum-mechanical con¬ 
ception of probability as something endowed with physical reality and not 
as merely a mathematical category of reasoning. 137 Also of epistemologi¬ 
cal importance was the very notion of “virtual” harmonic oscillators. For 
it was an early example of “some intermediate kind of reality,” a concep¬ 
tion which for some time seemed to have played a not negligible role in 
some quarters. For, as Heisenberg 138 later phrased it, when it became 
gradually clear that “no cheap solutions” could lead to a comprehensive 
understanding of quantum theory, but that one had to pay a high price, 
the “idea of having such intermediate kinds of reality was just the price 
one had to pay.” Connected with this is also Heisenberg’s contention that 
the idea of an atom as a set of oscillators or, as Lande once called it, “a 
virtual orchestra” 139 “prepared the way for the later idea that the assem¬ 
bly of oscillators is nothing but a matrix.” 140 
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However this may be, the Bohr-Kramers-Slater paper was undoubteu 
ly instrumental for the development of quantum mechanics in view of the 
fact that it made it possible to establish a dispersion theory which offered a 
new approach to the fundamental problem concerning the interaction of 
matter and radiation. For, if the emission or absorption frequencies of the 
atom, in accordance with Bohr’s frequency condition, have to be incorpo¬ 
rated into the dispersion formula, the theory has to come to grips with the 
dispersion process itself. A confrontation with the basic problem of the 
quantum theory, the problem of the interaction of matter and radiation, 
could no longer be evaded—and it was this kind of confrontation which 
eventually led to the birth of the earliest formulation of modem quantum 
mechanics. A discussion of the development of modem dispersion theory, 
leading to the Kramers-Heisenberg formulation, is therefore indispensa¬ 
ble for a profound comprehension of our subject. 

According to the classical 141 electron theory of dispersion the equation 
of motion of a quasi-elastically bound particle of mass m and charge e 
under the influence of an external sinusoidally varying field E = E 0 exp 
(Imvt) is 


mp k + mcj 0k 2 p k - e 2 E 


(4.5) 


where p k = er k is the electric moment 142 of the particle and v ok = co ok /7 tt 
is the proper frequency with which the particle would oscillate in the 
absence of external forces. The solution of the differential Eq. (4.5) is 


Pk = 


e 2 E /m 

4w 2 (v 0 * 2 - v 2 ) 


(4.6) 


Hence, from the well-known equation D = E + 4irP, where P = 2,N k p k is 
the polarization, N k the number, per unit of volume, of charged particles 
with proper frequency v ok or “dispersion electrons” of the kind k 9 and 
from the equation D = eE, where e = n 2 is the dielectric constant, it fol¬ 
lows that the index of refraction n is given by the following dispersion 
formula: 



= 1 + 7^ 


N k e 2 /m 
v 0 * 2 - V 2 


(4.7) 


an equation which agrees essentially with the dispersion formulas derived 
on the basis of the elastic-solid theory by Sellmeier and Helmholtz and on 
the basis of the electromagnetic theory by Maxwell. 143 If each atom con¬ 
tains f k dispersion electrons of the kind k, the polarizability a, defined by 
the equation P = aE, is given by 


a = 


477 2 m 



(4.8) 
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These classical formulas agreed fairly well with experience and offered 
a satisfactory interpretation of dispersion and also of absorption if veloc¬ 
ity-dependent terms, representing damping forces and leading to com¬ 
plex-valued indices of refraction, were introduced in Eq. (4.5). When 
Bohr’s theory of stationary states, however, superseded the classical con¬ 
ception of elastically bound electrons, these formulas, in spite of their de 
facto validity, suddenly lost completely their theoretical justification. It 
became therefore a major project for theoretical research to reconcile 
these equations with the basic assumptions of quantum theory. The early 
attempts at formulating a theory of dispersion in terms of quantum-theor¬ 
etic conceptions, as made by Debye , 144 Sommerfeld , 145 and Davisson , 146 
proved unsatisfactory, primarily because of their application of the classi¬ 
cal perturbation theory to Bohr’s atomic model by which they were led to 
the inevitable conclusion that the Fourier frequencies r x v x + • • • + r f v f 
are also the frequencies of the optical resonance lines contrary to experi¬ 
ence. 

The first step toward the formulation of a quantum-theoretic interpre¬ 
tation of dispersion was made by Ladenburg . 147 According to the classical 
electron theory the energy emitted per second by A oscillators of mass m, 
charge e, and frequency v 0 is J cX = UN /r, where r = 3 mc 3 / 87 rVv 0 2 is the 
time in which the oscillator’s energy is reduced to e ~ 1 of its original value 
(e is the base of the natural logarithms ) 148 and where U is the average 
energy of each oscillator, satisfying Eq. (1.6), page 9, U = 3 c 3 w Vo / 87 rv 0 2 . 
Hence 

J cl = — Nu Vo . (4.9) 

m 


Quantum-theoretically, on the other hand, according to Einstein, the 
energy / abs absorbed per second by transitions from the state / to the state 
k f where E t <E ki is given by / abs = hv ik NiB k l u ik , where N ( is the number 
among the A atoms which are in state /, u ik is the field energy density for 
the frequency v ik9 and B l k = B x are the Einstein probability coefficients 
defined on page 112. Under equilibrium conditions the energy / qu emitted 
per second by the transitions from state k to state i is equal to / abs , so that 


</qu = hv ik N,B ‘ k u ik . (4.10) 

Putting J cX = J qu and recalling Eq. (3.7), page 113, Ladenburg concluded 
that 


N=N { 


me 


817V v* 2 


A 1 


(4.11) 


Since for an atom in state i it has to be assumed 149 that f in Eq. (4.8) 
equals N/N if Eq. (4.11) implies that the atom has a varying electric 
moment of amplitude 
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(4.12) 


where v ik = (E k — E t )/h. 

Kramers 150 generalized Ladenburg’s result, arguing that if the state i is 
not the ground state also all states k ' have to be taken into account for 
which E k > < E t so that the secondary waves can be described as originat¬ 
ing from a varying electric moment whose amplitude is given by the for¬ 
mula 




Ek' < Ei 



(4.13) 


where v ik = ( E k — E t )/h and v k>i =E k > — E t . In view of Eq. (4.11) 
with N/N t = f t and provided v ok in Eq. (4.6) for p k is replaced by v ik or 
v k ' i9 respectively, Eq. (4.13) can be written A = 2f k p k — ^f k >p k >. Hence, 
Kramers contended, “the reaction of the atom against the incident radi¬ 
ation can thus formally be compared with the action of a set of virtual 
harmonic oscillators inside the atom, conjugated with the different possi¬ 
ble transitions to other stationary states... in the final state of the transition 
the atom acts as a ‘positive virtual oscillator’ of relative strength + /; in 
the initial state it acts as a negative virtual oscillator of strength — f 
However unfamiliar this ‘negative dispersion’ might appear from the 
point of the classical theory, it may be noted that it exhibits a close analogy 
with the ‘negative absorption’ which was introduced by Einstein, in order 
to account for the law of temperature radiation on the basis of the quan¬ 
tum theory.” 151 

In accordance with our principle of using a notation as uniform as 
possible, our notation of the dispersion formula (4.13) differs from Kra- 
mers’s original formulation in so far as Kramers denoted the transition 
frequencies (which we called v ik and v k i ) by and v e k > , where re¬ 
ferred to transitions to higher states k (absorption) and v e k . to lower states 
k ' (emission), relative to the stationary state under consideration. 152 This 
difference in notation, irrelevant as it is from the mathematical point of 
view, seems nevertheless to have some importance from the point of view 
of our study of the conceptual development of the theory. For, as our 
notation clearly indicates, Kramers was already in a position to describe 
the dispersive behavior of the atom by a two-fold infinite set of virtual 
oscillators, an idea of basic importance for the future development of 
matrix mechanics. But instead of doing so, his attention, guided by his 
notation, was focused, so to speak, only on the one-fold infinite set of 
virtual oscillators relative to every given stationary state. In compliance 
with the correspondence principle the dispersion law could not be formu- 
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lated without recourse to the frequencies v ik and v k . i9 referring to initial 
and final stationary states. These are states which potentially can be occu¬ 
pied by the atom, but are not actually occupied. In other words, hypo¬ 
thetical possibilities had to be regarded as physically active efficient 
causes. In order to overcome this conceptual difficulty it had to be as¬ 
sumed that the virtual radiation field continues to be operative through¬ 
out the whole lifetime of a given stationary state, an assumption which, in 
turn, entailed the statistical interpretation of energy conservation as pre¬ 
viously discussed. 

Kramers obtained these results in the early spring of 1924. In June, 
1924, Bom completed his paper “On quantum mechanics,” 153 to which 
reference has been made at the end of the preceding chapter. In this article 
Bom tried to establish a method by which the classical perturbation theo¬ 
ry of multiply periodic nondegenerate systems could be applied to quan¬ 
tum phenomena which involve external periodic disturbances or internal 
couplings. So far, Bom argued, theories of polyelectronic systems, such as 
the helium atom, which treated the electronic interactions along classical 
lines, had to fail since electrons affect each other with frequencies of the 
order of those of light, but the interaction between matter and light is 
evidently a “nonmechanical” quantum process; one cannot expect, there¬ 
fore, that the interactions between electrons can be treated along classical 
lines either. Bom now suggested a solution of the problem by generalizing 
Kramers’s treatment of the interaction between radiation and electrons 
into a “quantum mechanics” of interactions. In carrying out this pro¬ 
gram, Bom showed that the transition from classical mechanics to what 
he called “quantum mechanics” can be obtained in accordance with 
Bohr’s correspondence principle if a certain differential is replaced by a 
corresponding difference. The quantum-theoretic frequency v n n _ r for a 
transition from a stationary state characterized by the quantum number n 
to a stationary state characterized by the quantum number n' = n — t, it 
will be recalled, coincides, according to Bohr’s correspondence principle, 
for large n and small r, with the classical frequency v(«,r), the rth har¬ 
monic (overtone) of the fundamental frequency of the classical motion in 
state n: 

V n,n-r = v(/l,r) =rv(n, 1) (4.14) 

v(n, 1) = v is the classical fundamental frequency and is equal to the de¬ 
rivative of the Hamiltonian (or total energy) with respect to the action 
variable, 154 v = dH/dJ. To prove Eq. (4.14) we note that for//as a func¬ 
tion of J —nh Bohr’s frequency condition states that 


v 


n,n — r 


H(nh) —H[(n — r)h ] 


dH 

= r-= tv. 

dJ 


h 
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Now, comparing 


with 


v(n,r) = r 


dH 

dJ 


t dH 
h dn 


(4.15) 


v n , n -T=\\.H{n) — H(n - r)]. 
h 

Bom recognized that the quantum-theoretic frequency v n n _ r is obtained 
from the classical frequency v(n 9 r) by substituting the difference 
H(n) — H(n — r) for the differential r(dH/dn). Generalizing this rela¬ 
tion, Born postulated 155 that for any arbitrary function <£(/*), defined for 
stationary states, the differential r[d<&(n)/dn] is to be replaced by the 
difference <£(/*) —— r) or symbolically 

^ d<P(n) (4.16) 

dn 

Since this recipe for translating classical formulas into their quantum- 
theoretic analogues will play an important role in the discovery of matrix 
mechanics, as we shall see in due course, we shall call it briefly “Bom’s 
correspondence rule,” indicating thereby its intimate relation to Bohr’s 
correspondence principle. For future reference we mention on this occa¬ 
sion that an iteration of Bom’s correspondence rule shows that 


for, by (4.15) 


dv{n 9 r) 

dn 


< > V — 1/ 

v n + r,n v n,n — r 


(4.17) 


dv(n,r) 1 d [ dH(n)' 

T -=- T - T --- 

dn h dn dn 

= -| ~{[H(n + r) - H(n)] - [H(n ) -tf(n-r)]}. 
h 

Werner Heisenberg, who spent the winter semester 1923/24 in Gottin¬ 
gen, assisted Bofn in this work 156 —and with great profit, as his subse¬ 
quent comprehensive study of scattering and dispersion, carried out to¬ 
gether with Kramers, clearly shows. 

Kramers and Heisenberg 157 considered a nondegenerate multiple peri¬ 
odic system whose motion can therefore be described in terms of angle and 
action variables w 9 J and whose undisturbed electric moment M can be 
represented by 



C T „... T/ exp[2ir/(r,u;, + • • • + r f w f ) ] 
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where w k = v k t + 8 k and where reality requirements impose on the J- 
dependent Fourier coefficients the conditions CL ^ =C* (theas- 
terisk denotes complex conjugate values). Using the abbreviations 

CJ = T,V l + ■■■+TfVf, co' = t;vj + ••• +T' f v f , T D k =T k + r' k , 


o) 0 = co + (o ', = <5 0 = 


^<§1 + • • • + 7 jSf, C T = C T . c fi 



and 


— =Yr — —=Yt' — 

SJ i Tk dJ k SJ' Y Tk dJ k 

they showed that the classical electric moment M cl of the system, if acted 
on by a perturbation E exp (lirivt), is given by the expression 

Af c , = Ref^exp{27ri[ ( co 0 + v)t + S 0 ]} 


u sc * c - 


-c.- 


v 


4 \ SJ ' (o' + v T dJ co' + v 


:)W 


(4.18) 


where, for the sake of simplicity, it is again assumed that the incident wave 
is linearly polarized with its electric vector parallel to the direction of Af, 
and where the first summation extends over all integral indices 
r x ,...,7y from — oo to + oo, and the second summation similarly over all 
rj,...,r}. If all r k are taken as equal, respectively, to — r' k , so that 
co = — co' and 8/8J' = — 8/8J, the terms obtained in (4.18) represent 
that part of M cl which vibrates in resonance with the incident radiation of 
frequency v. 158 This part, denoted by Af cl (v), is consequently given by the 
equation 


M cl (v) = Re 


exp(27r/vr) ^ 


= Re 


1 S( cl \ 
4 8J\co — v/ 

exp (2777V?) Y -LA.(Sj- + S1/\e 
T, 7?,Tr 4 SJ\co — v co + v) 


• Ty= — OO 


o>> 0 


Applying Bom’s correspondence rule to the classical dipole frequency co 
and to the differential operator 8/8J , Kramers and Heisenberg obtained 
the quantum-theoretic electrical moment of the atom in state /, 


Mj{v)= Re 


exp(27r/vO 



+ 


C ki 2 \ 

n, + v / 


0 4 h - V v ik . + V/. 


where v ki = (E k — E t )/h, v kl = — v ik and where the amplitude C ki of 
the dipole moment C ki exp ( 2mv ki t ) is connected with the Einstein proba¬ 
bility A ( * by the relation 159 
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Akt (2 fl - v *,) 4 

A; hvhi =-=- Cv; . 


3c 


ki 


(4.19) 


In virtue of Ladenburg’s equation (4.11) now written with the two indices 
k and /, 

r mc3 a k 

fk ‘ ~ &ir 2 e 2 v ki 2 1 

the following equation is obtained, 160 


e _ 2ir z m n 2 

J ki , j ^ki^ki 

he 

and the polarization in state / is given by the expression 


(4.20) 


a. = 


e 2 ( y fki y fik' \ 

4fl 2 m Ke^e, v ki 2 - v 2 eTke, v ik ' 2 1/2 / 


(4.21) 


Equation (4.21) is, of course, the quantum-theoretic translation of Eq. 
(4.8). But contrary to the f k in Eq. (4.8) which denoted the number of 
dispersion electrons of kind k and were therefore nonnegative integers, the 
“/numbers”/*, in Eq. (4.21) are not necessarily integers. They satisfy, 
however, an important equation, the so-called “/-summation theorem” 
(“/-Summensatz”), as Thomas 161 and Kuhn 162 have shown. For the sake 
of simplicity we shall give an outline of the proof of this theorem only for 
the case of a one-electron system with periodicity degree 1. In this case the 
dipole moment is given by 


oo 


M=\ C T exp[27nr(vf + 8) ] (4.22) 

T = — OO 

in terms of which the average kinetic energy can be calculated as follows: 


E km = ~ ~ f i(2*nv) 2 £2 tt'C t C. 

JO 


2 e 2e J jo T * 

Xexp[27n(r + r'){vt + 8)]dt 


r' 


_ oo 

or E kin = (m'n 2 v z /2e 2 ) ^ r z C T C_ T . Now, the action variable J 

T = — OO 


oo 


which 163 equals 2 E kin /v is therefore given by J = ( 2'n 2 m/e 2 ) X ^Cr 2 - 

r = 0 

Differentiation with respect to J = nh yields 1 = (2ir z m/ 


oo 

he 2 ) ^ r(d/dn)(rvC T 2 ) and application of Bom’s correspondence prin- 

r = 0 

ciple, with due consideration of Eq. (4.20), leads finally to the “/-summa¬ 
tion theorem” 


2/« - 



(4.23) 
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CHAPTER 5 


The Formation of Quantum 

Mechanics 


5.1 The Rise of Matrix Mechanics 

In spite of its high-sounding name and its successful solutions of numer¬ 
ous problems in atomic physics, quantum theory, and especially the quan¬ 
tum theory of poly electronic systems, prior to 1925, was, from the meth¬ 
odological point of view, a lamentable hodgepodge of hypotheses, 
principles, theorems, and computational recipes rather than a logical con¬ 
sistent theory. Every single quantum-theoretic problem had to be solved 
first in terms of classical physics; its classical solution had then to pass 
through the mysterious sieve of the quantum conditions or, as it happened 
in the majority of cases, the classical solution had to be translated into the 
language of quanta in conformance with the correspondence principle. 
Usually, the process of finding “the correct translation” was a matter of 
skillful guessing and intuition rather than of deductive and systematic 
reasoning. In fact, quantum theory became the subject of a special crafts¬ 
manship or even artistic technique which was cultivated to the highest 
possible degree of perfection in Gottingen and in Copenhagen. In short, 
quantum theory still lacked two essential characteristics of a full-fledged 
scientific theory, conceptual autonomy and logical consistency. 

The core of the difficulties was, of course, the fact that according to 
classical physics, which served as the point of departure for quantum- 
theoretic calculations as long as atomic systems were described in classical 
terms, the optical frequencies of spectral lines should coincide with the 
Fourier orbital frequencies of the system’s motions, a result not borne out 
by experiment. The discrepancy was smoothed over by Bohr’s heuristical- 
ly invaluable principle of correspondence. For it made it possible to retain 
the description of motion in terms of classical kinematics and dynamics 
but allowed at the same time a certain tailoring of the results so as to fit it 
to observational data. It was Heisenberg who recognized that this ap¬ 
proach is only one alternative. The other alternative, which he chose in his 
historic paper “On a quantum theoretical interpretation of kinematical 
and mechanical relations” 1 and which led to the development of matrix 
mechanics, the earliest formulation of modem quantum mechanics, aban- 


208 



Formation of Quantum Mechanics 


209 


doned Bohr’s description of motion in terms of classical physics alto¬ 
gether and replaced it by a description in terms of what Heisenberg re¬ 
garded as observable magnitudes. 

As the following analysis attempts to show, Heisenberg’s crucial inter¬ 
vention, if examined with respect to its epistemological tenets, seems to 
have been made under the influence of those intellectual currents which 
have been mentioned in our discussion on the philosophical background 
of nonclassical conceptions. Bohr, it will be recalled, 2 gave a series of 
lectures in 1922 at the University of Gottingen, whose influence upon 
Pauli has already been pointed out. These lectures were also attended by 
Heisenberg. In one of these lectures Bohr spoke on the quadratic Stark 
effect and expressed his confidence that the theory in its present formula¬ 
tion is on the right track in spite of the as yet unclarified contradictions 
and the impossibility of calculating exactly the intensities of spectral lines. 
In particular, Bohr declared, the great success in explaining the linear 
Stark effect on the basis of the quantum conditions made him feel sure that 
the proposed interpretation of the quadratic effect must also be correct. 
Heisenberg, who had already begun his study of the problem of disper¬ 
sion, differed with Bohr on this issue. Recognizing that the quadratic 
Stark effect may be considered as a limiting case of dispersion for incident 
radiation of infinitely low frequency and realizing that a quantum-theore¬ 
tic treatment of dispersion cannot be worked out along the lines proposed, 
Heisenberg criticized Bohr’s statement. Although Bohr, as Heisenberg 
recalls, 3 discussed with him privately, after the lecture, this problem and 
related questions for over three hours, Heisenberg did not retract his 
challenge. On the contrary, Bohr’s frequent remarks that “the experimen¬ 
tal situation has to be covered by means of concepts which fit”—in 
consonance with the Kierkegaard-Hdffding insistence that every field of 
experience required its own conceptions and principles and, interestingly, 
also in consonance with the philosophies of Comte and Boutroux—seem 
to have impressed the young Heisenberg strongly and encouraged him in 
his search for an alternative approach. Heisenberg admitted repeatedly 
that it had been Bohr’s influence and thus, ultimately, the philosophical 
presumptions underlying Bohr’s conception of physics which had sug¬ 
gested to him the idea that the prevailing conceptual apparatus was not a 
categorical necessity or an indubitable catechism. 

Thus if Heisenberg’s conception of the very possibility of rejecting the 
description of atomic systems in terms of classical physics may be traced 
back, via Bohr, to one of the schools of philosophical thought referred to 
previously, his choice of the particular nature of the new concepts by 
which he replaced the classical ones goes back to the other philosophical 
trend mentioned, the positivism or logical empiricism of the early twen¬ 
ties. Interested in philosophy while still a student at the classical “gymna- 
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sium”—his first acquaintance with atomic theory was through Plato’s 
Timaeus —he was strongly impressed, first by Kant’s Critique of Pure 
Reason and later particularly by Wittgenstein’s writings. But what most 
appealed to his mind was Einstein’s treatment of the concept of time, his 
replacement of Lorentz’s “local” or “mathematical” time in the so-called 
Lorentz transformations 4 by the operationally defined and, in this sense 
observable, relativistic time. Einstein’s rejection of the Newtonian, opera¬ 
tionally undefinable, conception of simultaneity of spatially separated 
events and his relativistic reinterpretation of the Lorentz-FitzGerald con¬ 
traction, previously regarded as an effect unobservable in principle, un¬ 
doubtedly made a strong impression on Heisenberg. It is known that both 
in Munich, where Heisenberg had written his thesis under the guidance of 
Sommerfeld, and in Gottingen, where he was working under Bom and 
where Minkowski lectured, relativity was studied with great fervor. 

The preceding remarks will make it clear why Heisenberg insisted on 
using observable magnitudes as terms for the description of atomic states. 
From the modem sophisticated point of view a distinction between ob¬ 
servable and theoretically inferred magnitudes poses, of course, a highly 
complicated problem. In fact, when in 1926 Heisenberg confided to Ein¬ 
stein that “the idea of observable quantities was actually taken from his 
relativity,” 5 Einstein already pointed out that it is the theory which ulti¬ 
mately decides what can be observed and what cannot. One may admit, 
however, that such a distinction—even if it is, prior to the establishment of 
a theory, unwarranted—can be adopted as merely a heuristic principle. 

Although the explicit formulation of these deliberations was, of course, 
of a later date, it was precisely this heuristic approach which Heisenberg 
adopted, as the following discussion tries to show. Heisenberg rejected the 
classical notions of position and velocity or momentum of electrons in 
atoms not only because they were unobservable in the sense that, so far, 
nobody had measured them directly. For it may always be argued, as 
Heisenberg explicitly admitted, that future progress in experimental tech¬ 
niques would eventually make it possible to measure these quantities. 
“This hope,” continued Heisenberg—and this is the essential point— 
“could be regarded as justified if the formal theory [in terms of which 
these quantities are calculated] can consistently be applied to a distinctly 
defined domain of quantum-theoretic problems. Experience, however, 
shows that...the reaction of atoms to periodically varying fields cannot be 
described by these rules nor is their application to polyelectronic systems 
feasible.” 6 Thus, Heisenberg’s rejection of these quantities as unobserva¬ 
ble was based on two empirical facts, the experimental impossibility of 
directly measuring them and the practical failure of a theory which as¬ 
sumed them to be observable. He replaced these quantities of classical 
kinematics by the optical quantities of frequency and intensity, or rather 
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dipole amplitude, 7 and investigated whether a theory, assuming these as 
observable, can be worked out consistently. Basically, Heisenberg’s atti¬ 
tude, in this respect, resembled that of Einstein, for whom the concept of 
Newtonian time had lost its physical significance not only, as he showed in 
his analysis of the simultaneity of spatially separated events, because of its 
insusceptibility to operational determination but also because classical 
physics which assumed this concept as observable conflicted with experi¬ 
ence. 

A second fundamental innovation in Heisenberg’s approach was the 
way he employed Bohr’s correspondence principle. As mentioned before, 
the translation of classical formulas into the language of quantum theory 
usually required a great deal of guessing and had only one guideline, the 
correspondence principle. It had to be used in a separate way for almost 
every problem, the particular mode of its application depending on the 
specific data of the problem. While its versatility and fertility made it 
attractive to the more synthetically minded physicists, such as Bohr, its 
flexibility and lack of rigidity made it repugnant to the more analytically 
oriented theoreticians, such as Sommerfeld. Heisenberg, influenced by 
both Sommerfeld and Bohr, considered now the possibility of “guessing” 8 
—in accordance with the correspondence principle—not the solution of a 
particular quantum-theoretic problem but the very mathematical scheme 
for a new theory of mechanics. By integrating in this way the correspon¬ 
dence principle once and for all in the very foundations of the theory, he 
expected to eliminate the necessity of its recurrent application to every 
problem individually without jeopardizing its general validity. Through 
absorbing, so to speak, the correspondence in the foundations of the theo¬ 
ry, it would be possible, Heisenberg hoped, to solve quantum-theoretic 
problems in a mathematically rigorous way without losing the effective¬ 
ness of the principle. In the following analysis of what Heisenberg called 
“the kinematics of quantum theory” we shall see how skillfully he em¬ 
ployed Bohr’s principle and how he reduced the “guess” to an “almost 
cogent” 9 procedure. 

In classical physics any time-dependent quantity £ n = (t) can be 

represented by a Fourier expansion 10 

in = I i(n,T) = X x(n,T)exp[2iriv(n,r)t ] (5.1) 

r r 

where the 7 th Fourier component |’(/i,r) has the amplitude x(n,r) and 
the frequency v(n,r) = rv(«,l) = tv. Now, just as according to Bohr’s 
correspondence principle the quantum-theoretic frequency v n n _ T corre¬ 
sponds to v(n,r), Heisenberg assumed that a quantum-theoretic quantity 
x nn _ T corresponds to the Fourier amplitude x(n,r) or, symbolically, as 
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v(n,r)++v nn _ T (5.2) 

so also 


x(n,T)++x nn _ T (5.3) 

From their definition or from the reality requirement it follows that 


v(n, — r) = — v(n,r) 
x(n, — r) =x*(n,r ) 

^n — r,n ^n,n — r 



v(n,r) + v(n,r') = v(n,r + r') 

— r — r.fl — r 1 — y 


(5.4) 

(5.5) 

(5.6) 

(5.7) 

(5.8) 

(5.9) 


Heisenberg realized that in spite of the correspondences (5.2) and 


(5.3) an expression of the kind Y x nn _ T exp (2iriv nn _ T t), analogous to 

T 

(5.1), has no physical significance whatever, if only because of the essen¬ 
tially equal status of the two indices in v n n _ T . 

Heisenberg, however, assumed that the “set” (“Gesamtheit”) 
{x n n _ T exp(27nv n „ _ T t)}, or briefly {g„ t „_ T }, may well be chosen as the 
quantum-theoretic representative of the classical quantity £ n (t): 

£n++{x n ,n-T exp (2iriv n n _ T t)} (5.10) 

In classical physics, now, 


£ 


2 

n 


^ x ( «,t' ) exp [ 2mv( n,r') t ] 


or 


X 


X x(n,r 


r') exp [ 2mv{ n,r — r')t] 


(5.11) 


S 


2 

n 


r l / 


Xexp[2jr/v(n,r')t ]exp[2 iriv(n,r — r')t ] 
so that in virtue of (5.8) 


(5.12) 


£ 2 „ = '£x {2 \n,T)exp[2iriv(n,T)t] (5.13) 

T 

where 


x {2) (n 9 r) =^x(n,T')x(n,r — r’) 


( 5 . 14 ) 
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Heisenberg now asked, “How has the quantity £ \ to be represented in 
quantum theory?” In other words, if—in agreement with (5.10)— 

£ 2 n++ix™_ T exp(2fnV n> „_ T 0> (515) 

what is 

A straightforward replacement of the classical quantities in (5.12) by 
the quantum-theoretic quantities in accordance with (5.2) and (5.3) 
would yield 


exp(2i7iV„„ _ y t) exp( 2iriv„ n _ iT _ 7 , y t) (5.16) 

Y 

for the 7th term in the set representing £ \. Since the product of the two 
phase factors in (5.16) differs from exp(2mv nn _ r t), the phase factor 
required by (5.15) but exp ( 2mv n n _ Y t) exp( 2mv n _ Yn _ T t), in virtue of 
(5.9), does yield the required factor, Heisenberg postulated—and this is 
what he called an “almost cogent” conclusion—that (5.16) has to be 
replaced by 

X exp(2inV„ in _ / r)exp(2OTV n _ T , j „_ T 0 (5.17) 

Y 

which is, of course, equal to 

(X - T- r)exp(27riv„ >n _ (5.18) 

so that 


X 


( 2 ) 

rt,n — r 



Y 


X 


n — Y,n — r 


(5.19) 


is the required multiplication rule. 

He immediately generalized (5.19) for and for the product of two 
quantities, such as where r) n ++{y n ,„- T exp(2i7iv n „_ T r), and 

found that, in general, g n T}„ since, in general, X r 

Y 

7 ^ X y * 1 '* 1 -Y x n-Y,n-r • The new kind of multiplication turned out to be 

Y 

non-commutative. 

Having completed the “kinematics,” Heisenberg proceeded in the sec¬ 
ond part of his paper to the discussion of the “mechanics” of his new 
formalism. For the sake of simplicity only linear problems are considered. 

In the older theory the solving of a dynamical problem consisted of two 
steps: the equation of motion mx + f(x) = 0 had to be solved and the 
quantum condition J = §p dq = fmx 2 dt = nh had to be satisfied. With 
the help of (5.2), (5.3), and the new multiplication rule it was not difficult 



214 


Conceptual Development of Quantum Mechanics 


to transform the equation of motion into relations which had to be satis¬ 
fied by the quantum-theoretic amplitudes x n n _ T . For the determination 
of these “transition amplitudes” the quantum-theoretic version of the 
quantum condition had to be taken into account. To obtain the latter in 
terms of the new formalism, Heisenberg showed by a straightforward 
calculation that in virtue of (5.1) and (5.8) the quantum condition is 

J *l/v 

mx 2 dt = 4 / n 2 m Y rv(n,r) \x(n,r)\ 2 = nh 

0 r 

or, if differentiated with respect to n, 


h = 4-n 2 m Y r-^- (v(«,r)|x(n,r)| 2 ) 
t an 

Applying the correspondence rule (4.17), page 196, or rather its general¬ 
ization as used in the Kramers-Heisenberg theory of dispersion, 


d&(n,r) 

T - 

dn 




n -|- r,n 



n — t 


Heisenberg obtained 


h = 4v 2 m^{\x tt + T- „\ 2 v„ + r<n - K„_ r | 2 v„ >n _ T > (5.20) 


or 


00 


h = iu 2 m 2"tK + r,J 2 v„ + T , n - K„_ r | 2 v n ,„_ T > (5.21) 


r = 0 


which by Eq. (4.20), is precisely the Thomas-Kuhn “/-summation 
theorem” (4.23). 

In the last part of his paper Heisenberg applied his theory to the anhar- 
monic oscillator, classically described by x + <o%x + Ax 2 = 0—the prob¬ 
lem of the harmonic oscillator would not have required the application of 
the new multiplication rule—and found that its energy W, up to the order 
of magnitude A 2 , is given by the expression 


W=^2.(n + \) (5.22) 

2l7 

Equation (5.22) implied that the energy of the harmonic oscillator 
(>l = 0) is likewise given by the same expression in contrast to the result of 
the older quantum theory W = nhv 0 . Heisenberg applied his new formal¬ 
ism to the rotator, that is, to an electron revolving at constant distance a 
and with constant angular velocity co around the nucleus. From Eq. 
(5.21) and the assumption that at the ground state no radiation is emitted, 
he obtained 
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n,n — 1 


h 

4 T^rna 1 


(5.23) 


and from Eq. (5.19) for its energy W 

W=ma 2 77 2 (v 2 n _ n _ 1 + v^ + i, n ) - g J ma2 (« 2 + n + (5.24) 

in agreement with Kratzer’s observations. 11 Yet in spite of these promis¬ 
ing results and their support by experience, Heisenberg concluded his 
article with the remarks: “Whether a method of determining quantum- 
theoretic data in terms of relations between observable magnitudes as at 
present proposed can be regarded as already satisfactory in principle or 
whether it is still too crude an approach to the obviously highly complicat¬ 
ed problem of a quantum-theoretic mechanics, will become clear only 
after the presently rather superficially applied procedures will have been 
subjected to a penetrating mathematical investigation.” 12 

The above-mentioned solution of the anharmonic oscillator had been 
obtained by Heisenberg at the end of May, 1925, while he was recuperat¬ 
ing on the island of Heligoland from a heavy attack of hay fever. In June 
he returned to Gottingen and accepted an invitation to lecture at the 
Cavendish Laboratory in Cambridge. Still doubtful whether his paper 
made sense and determined “either to finish it during the few remaining 
days prior to his departure or to throw it into the flames,” he sent it to 
Pauli, asking him to return it with his comments after two or three days. 
Pauli’s reply was encouraging. Thus, in the middle of July Heisenberg 
presented the paper to Bom, who immediately recognized its importance. 
Bom sent it to the editor of the Zeitschrift fur Physik, who received it on 
July 29. 

The striking feature in Heisenberg’s paper was, of course, its represen¬ 
tation of physical quantities by “sets” of time-dependent complex 
numbers and the peculiar multiplication rule. Even Bom was puzzled. 
“Heisenberg’s multiplication rule,” Bom reminisced in his Nobel Prize 
Address, 13 “did not give me any rest, and after eight days of concentrated 
thinking and testing I recalled an algebraic theory which I had learnt from 
my teacher Professor Rosanes in Breslau.” Bom recognized that Heisen¬ 
berg’s “sets” were precisely those matrices with which he had become 
acquainted when in 1903, as a young student at the University of Breslau, 
he attended lectures on algebra and analytical geometry given by Jacob 
Rosanes, 14 a pupil of Frobenius. 15 

Heisenberg’s strange multiplication rule turned out to be simply the 
rule “row times column” for the multiplication of matrices, or in symbols, 
if x denotes the matrix (x mn ) and y the matrix (y mn ), then the element in 
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the mth row and nth column of the product of these two matrices is given 
by the formula 


(Xy)mn = X X mk y«n 

k 

a rule which had been formulated for the first time, though only for finite 
square matrices, seventy years earlier by Arthur Cayley. 16 

In connection with our present discussion of Heisenberg’s matrix rep¬ 
resentation of quantum-mechanical states, the following mathematical 
digression seems to be of some interest. As is well known, subsequent 
formalisms of quantum mechanics employed—in addition to matrices 
and apart from complex-valued functions (Schrodinger wave func¬ 
tions)—also Hilbert space vectors and, after the advent of the quantum 
mechanics of spinning particles, quaternions as well. 17 The striking point, 
now, is the fact that precisely these three kinds of mathematical entities, 
matrices, multidimensional vectors, and quaternions, happened to be 
equally involved in a historically interesting controversy concerning the 
priority of the discovery of matrices. Sylvester 18 credited Cayley as the 
exclusive discoverer of matrices, calling the latter’s paper of 1858 “a sec¬ 
ond birth of algebra, its avatar in a new and glorified form.” Seven years 
later Gibbs criticized Sylvester severely for this statement and, referring 
to Grassmann’s Ausdehnungslehre , 19 claimed that “the key to the theory 
of matrices is certainly given in the first ‘Ausdehnungslehre’; and if we call 
the birth of matricular analysis the second birth of algebra, we can give no 
later date to this event than the memorable year of 1844.” 20 Grassmann, 
as we shall see later on, anticipated a great deal of the theory of linear 
vector spaces. Now, Gibbs regarded 1844 as a “memorable’ 4 year in the 
annals of mathematics because it was the year of the appearance of Hamil¬ 
ton’s first paper on quaternions. 21 But Hamilton has also been named the 
discoverer of matrices. The first to do so was Tait. In a letter to Cayley, 
who to his bad luck had shown that quaternions can be represented as 
matrices of order 2 with complex elements, Tait, referring to Hamilton’s 
“linear and vector operators,” called Cayley’s discovery only a modifica¬ 
tion of Hamilton’s ideas. Whereupon Cayley retorted: “I certainly did not 
get the notion of a matrix in any way through quaternions....” 22 The claim 
was renewed in 1890 by Taber, 23 who declared: “Hamilton must be re¬ 
garded as the originator of the theory of matrices, as he was the first to 
show that the symbol of a linear transformation might be made the sub¬ 
ject-matter of a calculus.” 24 To give further details on this controversy, 
which involved W. K. Clifford, A. N. Whitehead, A. Buchheim, and in 
the twentieth century T. J. I’A. Bromwich, who preferred to be “unable to 
give any opinion on this matter,” 25 would lead us too far astray from our 
subject. 
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Now, matrices, multidimensional vectors, and quaternions are exten¬ 
sions of the concept of real numbers. Beyond the domain of complex 
numbers, however, extensions are possible only at the expense of HankePs 
principle of permanence, 26 according to which generalized entities should 
satisfy the rules of calculation pertaining to the original mathematical 
entities from which they have been abstracted. Thus, while associativity 
and distributivity could be preserved, commutativity had to be sacrificed. 
It was the price which had to be paid to obtain the appropriate mathemat¬ 
ical apparatus for the description of atomic states. 

The mathematical process shows a certain analogy to a fundamental 
epistemological principle which Heisenberg once expressed as follows: 
“Almost every progress in science has been paid for by a sacrifice, for 
almost every new intellectual achievement previous positions and concep¬ 
tions had to be given up. Thus, in a way, the increase of knowledge and 
insight diminishes continually the scientist’s claim on ‘understanding’ na¬ 
ture.” 27 

Prior to 1925 matrices had rarely been used by physicists. Notable 
exceptions were Mie’s nonlinear electrodynamics 28 and, interestingly, 
Bom’s work on the lattice theory of crystals. 29 In these exceptional cases 
matrices were even subjected to algebraic operations and among them to 
multiplication. But on the whole, matrices were regarded as belonging to 
the realm of pure mathematics. 

In retrospect, it seems almost uncanny how mathematics now prepared 
itself for its future service to quantum mechanics. Bocher’s Introduction to 
Higher Algebra, 30 the standard textbook on matrix theory, appeared in a 
German 31 translation in 1910, one year after the publication of Kowa- 
lewski’s treatise on determinants, the first text to include infinite determi¬ 
nants. 32 Its second and revised edition was completed in May, 1924. At 
the same time Courant, 33 utilizing Hilbert’s lectures, finished in Gottin¬ 
gen the first volume of the well-known Methods of Mathematical Physics. 
Published at the end of 1924, it contained precisely those parts of algebra 
and analysis on which the later development of quantum mechanics had 
to be based; its merits for the subsequent rapid growth of our theory can 
hardly be exaggerated. 

One of Courant’s assistants in the preparation of this work was Pascual 
Jordan. 34 Having had to familiarize himself for this purpose with the theo¬ 
ry of matrices, Jordan made a thorough study of Bocher’s exposition 35 as 
well as of other publications on this subject and acquired great skill in the 
manipulation of matrices. 36 

At the same time Bom was looking for a qualified assistant for the 
project of putting the new matrix mechanics on logically firm founda¬ 
tions. He turned at first to Pauli. Although, as we shall presently see, Pauli 
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was soon to make an important contribution to matrix mechanics, he 
rejected Bom’s proposal. It was not easy to find a competent assistant for 
this project. Physicists either did not know 37 what matrices were or, if they 
did, were reluctant to apply them to theoretical problems. This aversion 
probably explains why Heisenberg’s historic paper of 1925 had not been 
reviewed in the Physikalische Berichte, the official Abstracts of the Ger¬ 
man Physical Society, prior to the 1927 issue, and there only in a one- 
sentence statement written by one of its editors. 38 The report given in 1926 
by Science Abstracts , 39 issued by the Institute of Electrical Engineers in 
association with the Physical Society of London and with the American 
Physical Society, was not much more detailed. 

It should be noted, however, that the reluctance with which Heisen¬ 
berg’s matrix mechanics was met has to be attributed not only to the 
unaccustomed mathematics involved but also, at least in part, to the unfa¬ 
miliarity of its conceptual background. Thus, for example, Segre 40 reports 
that Fermi, having been awarded, in 1923, a fellowship by the Italian 
Ministry of Education (Ministero delPIstruzione Pubblica) to study 
abroad, chose Gottingen to work with Bom but left it soon for Leyden. 
Segre continues: “Heisenberg’s great paper on matrix mechanics of 1925 
did not appear sufficiently clear to Fermi, who reached a full understand¬ 
ing of quantum mechanics only later through Schrodinger’s wave me¬ 
chanics. I want to emphasize that this attitude of Fermi was certainly not 
due to the mathematical difficulties and novelty of matrix algebra; for 
him, such difficulties were minor obstacles; it was rather the physical 
ideas underlying these papers which were alien to him.” Obviously, the 
abstract, almost philosophical, nature of Heisenberg’s paper and its ap¬ 
parent lack of immediate practical applications made it repugnant to the 
more experimentally inclined physicists. But precisely the same qualities, 
viewed as initiating a new era of progress in theoretical physics, appealed 
to the more philosophically minded theoreticians—as far as they took 
notice at all. Thus Bohr, at the sixth Scandinavian Mathematical Con¬ 
gress, held in Copenhagen at the end of August, 1925, only a few weeks 
after the completion of Heisenberg’s paper, acclaimed it as an outstanding 
achievement. To him, of course, there was nothing “alien” in it. On the 
contrary, he declared: “The whole apparatus of quantum mechanics can 
be regarded as a precise formulation of the tendencies embodied in the 
correspondence principle.” 41 Concluding his address, he remarked: “It is 
to be hoped that a new era of mutual stimulation of mechanics and math¬ 
ematics has commenced.” 

Now it happened that Bom, while traveling by train to Hanover, told a 
colleague of his from Gottingen about the fast progress in his work but 
also mentioned the peculiar difficulties involved in the calculations with 
matrices. It was fortunate and almost an act of providence that Jordan, 
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who shared the same compartment in the train, overheard this piece of 
conversation. At the station in Hanover Jordan then introduced himself 
to Bom, told him of his experience in handling matrices, and expressed his 
readiness to assist Bom in his work. 42 This, then, was the beginning of a 
fruitful collaboration which led to the publication of the fundamental 
Bom-Jordan paper “On quantum mechanics,” 43 the first rigorous formu¬ 
lation of matrix mechanics. It was received by the editor of the Zeitschrift 
furPhysik on September 27,1925, exactly sixty days after the Heisenberg 
paper. 

The Bom-Jordan paper contains four chapters, the first of which ex¬ 
pounds the pertinent theorems of matrix theory. The second chapter de¬ 
velops the foundations of quantum dynamics for nondegenerate systems 
with one degree of freedom. Representing the classical coordinate q (pre¬ 
viously denoted by x) by a matrix q = [q mn exp(2mv mn t) ] and the mo¬ 
mentum p by a matrix p = [p mm exp(2mv mn t) ], Bom showed that the 
extremalization of the trace of the matrix 

L = pq — H(pq) 

whereq= [ 2mv mn q mn exp(2iriv mn t)] and where H(pq) is the (matrix) 
Hamiltonian function of the matrices p and q, leads to the canonical equa¬ 
tions 

q = d H/<9p p= — d H/dq 

Representing both q and p independently as matrices, and not only q as 
Heisenberg did, Bom established for the first time what was later called 
the basic commutation relation in quantum mechanics. Bom’s derivation 
was based on the correspondence principle. Starting from the quantum 
condition of the older quantum theory, 

nh=J=(^>pdq = j pq dt 

and expanding p and q into Fourier series, 

p — ^p(n 9 r)exp(2mv(n 9 r)t) q = ^ q(n 9 r)exp(2mv(n 9 r)t) 

T T 

Bom obtained by simple integration 

nh = — 2m^rp(n 9 T)q*(n 9 r) 

T 

where use has been made of Eq. (5.5). From the last equation Bom de¬ 
duced 


h 

2m 



d 

T - 

dn 


( p(n 9 r)q*(n 9 T)) 
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which he expressed in the language of quantum mechanics in accordance 
with the translation rule as 

— = — y,ip(n + T,n)q*{n + r,n) —p(n,n — r)q*(n,n — r)) 

2m r 

or by Eq. (5.7), page 212, as 

— = Y(p(n,n - r)q(n — r,n) — q(n,n + r)p(n + r 9 n )) 

2m r 

Having thus proved the validity of the matrix equation 


pq —qp = 



(5.25) 


as far as diagonal elements are concerned (1 is, of course, the unit matrix) 
Bom conjectured that it holds also for the nondiagonal elements. That 
this is indeed the case was shown by Jordan on the basis of the canonical 
equations. First it was shown that a matrix with a vanishing time deriva¬ 
tive is diagonal. Calculating the time derivative of the matrix d = pq — qp, 
Jordan found in virtue of the canonical equations that d = 0, which, as 
just mentioned, entails that all nondiagonal elements are zero. The funda¬ 
mental quantization rule (5.25) has thus been rigorously proved. The 
equations of motion have shown that pq—qp is a diagonal matrix. That all 
diagonal elements are equal to h /2m has been a consequence of the corre¬ 
spondence principle. Moreover, since (5.25), as Bom soon realized, is the 
only fundamental equation in which h appears, the introduction of 
Planck’s constant into quantum mechanics has likewise been a conse¬ 
quence of the correspondence principle. In the present paper Bom and 
Jordan called this equation the “exact quantum condition” and already 
recognized its axiomatic status within the logical structure of the theory. 44 
“I shall never forget the thrill I experienced when I succeeded in condens¬ 
ing Heisenberg’s ideas on quantum conditions in the mysterious equation 
PQ — QP = h /2m” declared Bom 45 in reviewing his work of 1925. 

In the same chapter of their common paper Bom and Jordan also 
proved the energy theorem and Bohr’s frequency condition for the case 
H = H x {p) + H 2 {q) and showed how these conclusions have to be gener¬ 
alized for the case H = H(p,q). 

In the third chapter the theory is applied to the harmonic oscillator and 
Heisenberg’s results are retrieved. In contrast to the latter, however, who 
still needed the correspondence principle to show that transitions occur 
only between adjacent stationary states, 46 Bom and Jordan derived this 
restriction as a logical conclusion from their theory alone. Following a 
comprehensive treatment of the anharmonic oscillator, a discussion of the 
quantization of the electromagnetic field concluded the paper. 
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Having completed the paper, Bom left for a vacation in the Engadine. 
After his return from Switzerland in September he resumed his work with 
Jordan and, by correspondence, also with Heisenberg. In the middle of 
November the famous Bom-Heisenberg-Jordan paper “On quantum me¬ 
chanics IF* 47 was completed. Considered as a sequel to the Bom-Jordan 
paper, 48 it generalized the results to systems with an arbitrary finite num¬ 
ber of degrees of freedom, introduced canonical transformations, laid the 
foundations for a quantum-mechanical theory of time-independent as 
well as time-dependent perturbations, with the inclusion of degenerate 
cases, and discussed the treatment of angular momenta, intensities, and 
selection rules from the viewpoint of matrix mechanics. But above all it 
established a logically consistent general method for solving quantum- 
mechanical problems and, in addition, uncovered the connection between 
the mathematical apparatus of matrix mechanics and certain develop¬ 
ments in ordinary algebra and analysis. As this short description indi¬ 
cates, the “three-man paper” (“Drei-Manner-Arbeit”), as Bom likes to 
call it, was the first comprehensive exposition of the foundations of mod¬ 
em quantum mechanics in its matrix formulation. From our point of view 
the last two items in the preceding outline deserve special attention. For 
the sake of simplicity our analysis of these features will be confined to the 
discussion of dynamical systems with only one degree of freedom. 

Postulating the validity of what is called at first the “fundamental 
quantum-mechanical relation” 49 and later “the commutation relation” 50 

pq —qp = -A-l (5.26) 

2m 


Bom, Heisenberg, and Jordan show that for any function f(pq) which can 
be expanded as a power series in p and q, the following equations hold: 


fq - qf = 


pf - fp = 


h di 


2m dp 
h di 


(5.27a) 


(5.27b) 


2m dq 

since (5.27) hold for q, p, as well as for their sums and products. 

By introducing the energy function or Hamiltonian H (p,q), in analogy 
to the classical Hamiltonian of the system, and the corresponding canoni¬ 
cal equations of motion 


P 


d H d H 

— q = — 

dq dp 


(5.28) 


they derive the energy theorem and Bohr’s frequency condition as follows: 
Since Eqs. (5.27) hold with f=H, p= — (2m/h) (pH — Hp), and 
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q = — ( 2m/h ) ( qH — Hq ) and hence, by a reasoning similar to that used 
in the proof of (5.27), 

f = - —(fH-Hf) (5.29) 

h 

If f = H, Eq. (5.29) yields H = 0, which proves the energy theorem and 
shows that H nm = W n S nm , where the time-independent diagonal element 
W n is interpreted as the energy of the system in the wth stationary state. 
Bohr’s frequency condition v nm = ( W n — W m )/h follows immediately 
from (5.29) for f = q. 

As Bohr, Heisenberg, and Jordan continue to explain, the (almost) 
converse reasoning of the above is also valid, that is, from the assumption 
H = 0 follows (5.28) in the following sense: If two Hermitian matrices p° 
and q° (which we assume as being composed of time-independent ele¬ 
ments) can be found which satisfy the commutation relation 

p°q° — q°p° = —— 1 (5.30) 

2m 

then 


P = (P°nm exp(2 mv nm t)) q = (q° nm exp(2 mv nm t)) 

where v mn = (H nn — H mm )/h 9 satisfy Eqs. (5.28) and the commutation 
relation. This can be proved as follows. It is easily verified that for an 
arbitrary function f(pq) 

(fnm (PQ)) = (/„ m (p°q 0 )exp(2j7-/v nm O) (5.31) 

Substituting in (5.31) for f (pq) the function pq — qp, one deduces imme¬ 
diately from Eq. (5.30) that p and q satisfy Eq. (5.26). Hence (5.27) can 
be applied and 

(^P P^)nm 2‘77'ZV nmPrim Pnm 

(and similarly for q nm ), which proves the validity of Eq. (5.28). 

In consideration of these results the three authors could formulate the 
process of solving the canonical equations of motion as follows: Find two 
(time-independent) Hermitian matrices p° and q° which satisfy the com¬ 
mutation relation and in terms of which the Hamiltonian of the system is a 
diagonal matrix. 

By introducing the concept of canonical transformations, Bom, Hei¬ 
senberg, and Jordan reduced the process of solving the equations of mo¬ 
tion and, in particular, of finding the energy values H nn to certain prob¬ 
lems in algebra and analysis for which—though under somewhat more 
restrictive conditions—solutions were available or were the object of con¬ 
temporary mathematical research. They defined a transformation of the 
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variables p, q to P, Q as “canonical” if the commutation relation remains 
invariant, that is, if with pq — qp = ( h /2m)l also PQ — QP 
= (h /2m) 1. As shown by Eqs. (5.27), under canonical transformations 
the canonical equations of motion are invariant. It is also easy to see that 
with an arbitrary matrix U 

P = U _1 pU Q = U _1 qU (5.32) 

is a canonical transformation and that for any function f(p,q) 

f(P,Q) = f(U -1 pU,U -1 qU) = U-'f^qJU (5.33) 

The last equation made it possible for Bom, Heisenberg, and Jordan to 
reformulate the problem as follows (in view of the preceding consider¬ 
ations all matrices in the following discussion can be assumed to be com¬ 
posed of time-independent elements): If p and q are given which satisfy 
the commutation relation, find a canonical transformation such that 

H(P,Q) = H(U _1 pU,U _1 qU) = U" ! H(p,q)U = W (5.34) 

is a diagonal matrix. 

In the third chapter 51 of their paper Bom, Heisenberg, and Jordan 
succeeded in reducing this process, as reformulated, to a familiar math¬ 
ematical procedure. For this purpose they associated with every Hermi- 

tian matrix a = ( a nm ) the bilinear form A ( x,z ) = ^ a nm x n z m such that 

nm 

A ( x,x *) is real for arbitrary x l9 x 2 , .... The transformation x n = ^ 

j 

with the matrix u* = (i/*) transforms A(x,x*) into B(y,y*) 
= ^ b kj y k yf , where b = ( b kj ) = u*au. The matrix u is defined as “or- 

*3 

thogonal” if it leaves the Hermitian unit form E(x t x*) = 2 x n x* invar¬ 
iant. 52 It is shown that the necessary and sufficient condition for this 
property is 

u = u _1 (5.35) 

For finite matrices, it is now pointed out, there always exists an “or¬ 
thogonal” transformation “to principal axes” such that A(jc,x*) 

= ^ W n y n y* or > equivalently, there always exists a matrix u which satis¬ 
fies u* = u~ 1 and 

u _1 au = W = ( W n 8 nm ) (5.36) 

Bom, Heisenberg, and Jordan now refer to the algebraic methods and 
their elaborations in the Methoden der mathematischen Physik by Cour- 
ant and Hilbert 52a by means of which the W n in Eq. (5.36) can be calculat¬ 
ed in terms of the a nm without the need of actually performing the trans- 
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formation. In the same way, they contend, the diagonal elements W n of 
the diagonal matrix W in (5.34) can be computed in terms of the elements 
of the (in general, not diagonal) matrix H (p,q). The calculation of energy 
values has thus been reduced to the solution of the eigenvalue problem of 
Hermitian matrices in linear algebra. Unfortunately, however, as evi¬ 
denced by Eq. (5.26), quantum-mechanical matrices are in general not of 
finite order, nor are they even bounded. It had therefore to be assumed 
that the procedure, which holds for the finite case, remains valid also for 
infinite matrices if, in addition to the discrete spectrum of eigenvalues (or 
“point spectrum”), allowance is made for a continuous spectrum of eigen¬ 
values (or “line spectrum”), in accordance with the results obtained by 
Hilbert and Hellinger. 

As Bom and his collaborators pointed out, Eq. (5.34) is analogous to 
the Hamilton-Jacobi differential equation in classical physics and the ma¬ 
trix U corresponds to the action function. Equation (5.34) or HU = UW, 
treated as an equation of finite matrices, implies that for every m 

2(H mk -W n S mk )U kn =0 (5.37) 

k 

which shows that the eigenvalues W n are the roots of the secular (or 
characteristic) equation 


H-/ll|=0 (5.38) 

Since for each W n the U kn can be computed, the transformation matrix U 
can be determined. 

As our following brief discussion will show, the origin and develop¬ 
ment of the mathematical apparatus under consideration were intimately 
connected with the techniques of computing planetary motions in astron¬ 
omy. In fact, just like the mathematics of the older quantum theory and, in 
particular, the mathematics of its theory of perturbations, the mathemat¬ 
ics of matrix mechanics can be traced back to the study of planetary orbits 
and of the orbits of planetary satellites. Ironically, matrix mechanics, the 
outcome of Heisenberg’s categorical rejection of orbits, had eventually to 
resort to the mathematics of orbital motions. The very name of Eq. (5.38) 
already testifies to the truth of this contention. For astronomers called an 
equation of this type a “secular” equation (from the Latin saeculum 
= generation, saeculum civile = a period of 100 years) as it enabled them 
to determine “secular” (long-period) disturbances of planetary orbits, as 
regards eccentricities and inclinations about their mean values. 

It was with reference to astronomical problems that Lagrange, 53 a few 
years after Euler’s first discussion 54 of this subject, investigated the ques¬ 
tion of transforming bilinear forms of n variables to sums of squares 
(transformations to principal axes) and solved it for the case n = 2 and 
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n = 3. In a paper presented to the Berlin Academy in 1773 Lagrange 55 
showed that all roots of the secular equation for a symmetric form of three 
variables are real. His interest in these problems as well as Laplace’s 56 was 
motivated by the discovery that imaginary roots would have led to func¬ 
tions which, increasing exponentially with time, are incompatible with the 
assumed stability of the planetary system. 57 Lagrange and Laplace even 
thought, erroneously as Routh 58 later showed, that the existence of multi¬ 
ple roots is likewise inconsistent with this assumption. 59 The secular equa¬ 
tion studied by Lagrange and Laplace was of the sixth degree, only six 
planets being known at that time. After William HerschePs discovery of 
Uranus in 1781 numerical methods of solving secular equations of the 
seventh degree were studied by astronomers and mathematicians, as, for 
example, by Jacobi 60 in 1846. In the same year Leverrier and Adams 
discovered Neptune, and secular equations of the eighth degree became 
the subject of research. 61 Meanwhile, Lagrange’s result concerning the 
reality of eigenvalues 62 of symmetric bilinear forms for n = 3 was general¬ 
ized in 1829, again in connection with astronomical considerations, for an 
arbitrary finite n by Cauchy 63 and was finally extended to Hermitian 
forms in 1855 by Charles Hermite. 64 Needless to say, simultaneously with 
these developments the study of transformations to principal axes en¬ 
gaged the attention also of geometers and algebraists. 65 Our insistence, 
however, on the importance of astronomical calculations of orbits for the 
development of the mathematics of matrix mechanics gains additional 
support as soon as we consider the extension to infinite systems of linear 
equations with infinitely many variables. For, although Kotteritzsch 66 
was probably the first to try such an extension, the real development of 
this process, which proved to be of extreme importance for the later elabo¬ 
ration of matrix mechanics and quantum mechanics in general, had its 
origin, again, in astronomical computations of orbital motions. It was 
Hill’s paper on the motion of the lunar perigee which initiated the system¬ 
atic study of infinitely many equations with infinitely many variables. 
Hill, 67 treating such systems as if they were finite, introduced infinite 
determinants and applied to them the rules valid for finite determinants. 
His results agreed with observation, but the legitimacy of his mathemat¬ 
ical procedures remained an open question. Recognizing the importance 
of Hill’s paper as well as its analytical deficiencies, Poincare 68 attempted 
to put it on logically firm foundations by constructing a theory of determi¬ 
nants of infinite order in a mathematically rigorous way. Poincare’s work, 
in turn, inspired Helge von Koch 69 to study the use of infinite determi¬ 
nants in the theory of linear differential equations. Soon it became evident 
that such extensions—and in particular generalizations, to infinitely 
many variables, of orthogonal transformations of quadratic forms to prin- 
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cipal axes—are important not only for the solution of differential equa¬ 
tions but also for the theory of integral equations. In fact, it was one of 
Hilbert’s greatest achievements to have recognized that eigenvalues of 
quadratic forms and eigenvalues occurring in boundary problems of dif¬ 
ferential and integral equations have a common origin. 

In his famous work on the foundations of a general theory of integral 

equations 70 he considered bounded quadratic forms of infinitely many 

00 

variables which he defined as follows: 71 the quadratic form £ a kl X k X l i S 

kj~L i 

bounded if there exists a positive constant C such that for every set of 


variables jc„ 
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satisfying 



and for every 
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<C. These forms, now, if subjected to an orthogonal 


i k,i= i i 

transformation to principal axes, give rise, as he showed, in addition to a 
series of denumerably many quadratic terms, also to certain integrals; that 
is to say, their eigenvalue spectrum contains, in addition to discrete values 
or a “point spectrum,” also a continuum of terms or a “line spectrum.” 72 

A theory of bounded quadratic forms of infinitely many variables was 
also constructed by Hellinger 73 independently of Hilbert’s theory and 
without the need of Hilbert’s limit transitions from algebra to analysis 
which involved complicated convergence considerations. Hellinger’s re¬ 


sult that for such systems the bilinear Hermitian form ^ H mn x m x* can be 
transformed to the expression 


+ J (5.39) 

where the discrete real numbers W n form the point spectrum and the real¬ 
valued functions W(<p) of the continuous parameter cp the line spectrum, 
was assumed by Bom and his group to be valid also for systems which are 
not bounded. It enabled them to derive the continuous spectrum from one 
and the same theory and to account, as, for example, in the case of the 
hydrogen spectrum, for four kinds of transitions, transitions from ellipse 
to ellipse, from ellipse to hyperbola, from hyperbola to ellipse, and from 
hyperbola to hyperbola, on the basis of a logically consistent approach. 

Let us conclude our analysis of the rise of matrix mechanics, the earli¬ 
est version of quantum mechanics, with some remarks on the relationship 
between classical mechanics and quantum mechanics. One may rightfully 
wonder how the formalism or language of quantum mechanics, which, as 
we shall see in great detail later on, differs from the conceptual point of 
view most radically from that of classical mechanics, seems nevertheless 
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to include the latter as a special case, as the correspondence principle in its 
familiar formulation 74 contends. As the present chapter has shown and as 
will become increasingly clear in the sequel, the evolution of quantum 
mechanics out of classical physics was not a logically continuous process. 
However inconspicuous the change at the critical point may appear, even 
if it is only a transposition of indices, 75 it constitutes a logical hiatus of far- 
reaching consequences. In fact, there is simply no ultimate logical connec¬ 
tion between classical and quantum mechanics—“any more than there is 
one between a sense-datum language and a material object language.” 76 
How, then, can quantum theory be reduced by a logically continuous 
inference to classical mechanics as a limiting case, as the well-known 
extrapolation 77 of the Balmer formula for large quantum numbers seems 
to exemplify? 

To clarify the situation it is advisable to distinguish between two princi¬ 
ples of correspondence: one referring to the relationship between classical 
mechanics and the older quantum theory, and one relating classical me¬ 
chanics to modem quantum mechanics. As to the former—ignoring at 
present Bohr’s farsighted reservation—the popular interpretation of the 
principle as establishing a logical connection is not wholly unfounded 
since the older quantum theory was essentially but a modification of clas¬ 
sical physics. As to the correspondence principle in the latter significance, 
the situation is more complicated. As we have seen, the principle, deeply 
imbedded in the very foundations of the new formalism, played a decisive 
role in the establishment of matrix mechanics. And yet, Heisenberg’s pos¬ 
tulated indicial modification, inconspicuous as it was, and the Bom-Jor- 
dan commutation relations 78 led, so to speak, to a complete logical rup¬ 
ture with the classical scheme of conceptions. As a consequence, the 
correspondence principle, while leading to numerical agreements between 
quantum mechanical and classical deductions, affirmed no longer a con¬ 
ceptual convergence of the results but established merely a formal, sym¬ 
bolic analogy between conclusions derived within the context of two dis¬ 
parate and mutually irreducible theories. It only showed that under 
certain conditions (for instance, for high quantum numbers or, in classi¬ 
cal terms, for great distances from the nucleus) the formal treatments in 
both theories converge to notationally identical expressions (and numeri¬ 
cally equal results) 79 even though the symbols, corresponding to each 
other, differ strikingly in their conceptual contents. 

But one final comment: Poincare once emphasized that the form of 
classical mechanics “is due to the influence of celestial mechanics.” 80 In 
the present chapter it has been shown that the mathematics of quantum 
mechanics is also greatly indebted to the methods that have been used in 
solving problems of celestial dynamics. 
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5.2 Modifications of Matrix Mechanics 

Bom’s mathematically illegitimate assumption concerning the use of al¬ 
gebraic procedures for unbounded matrices led, as we have seen, in a 
natural way to full agreement with experience. To put matrix mechanics 
on sound foundations confronted mathematics with new problems, just as 
in the case of Hill’s work on the lunar perigee a few decades earlier. In fact, 
“a new era of mutual stimulation of mechanics and mathematics” was 
ushered in, just as Bohr predicted. In December, 1925, Heisenberg wrote a 
detailed exposition of the principles of matrix mechanics for the Mathe- 
matische Annalen , 81 the leading German journal for research in pure 
mathematics at that time. As a result, many mathematicians became in¬ 
terested in matrix mechanics. Thus, to mention only one example, Aurel 
Wintner conducted during the winter semester 1927/28 at the University 
of Leipzig, where Heisenberg had just been appointed professor of theo¬ 
retical physics, a seminar on the spectral theory of infinite matrices with 
special consideration of Hermitian unbounded matrices. In spite of nu¬ 
merous important contributions the seminar did not produce an exhaus¬ 
tive solution of the problem and, as Wintner had to admit, “a complete 
and mathematically satisfactory treatment of quantum-theoretic matrices 
continues to be a desideratum.” 82 

The ushering in of the new era of cooperation between physics and 
mathematics found its first tangible expression, so to speak, in the colla¬ 
boration between Max Bom and Norbert Wiener at the Massachusetts 
Institute of Technology. They had met for the first time when Wiener was 
visiting Gottingen in 1924 and gave a talk on his work on general harmon¬ 
ic analysis, which was very well received. Courant, the head of the depart¬ 
ment of mathematics—Klein had already retired because of ill health— 
arranged with Wiener some kind of exchange program according to 
which Wiener should return to Gottingen in 1926. Meanwhile Bom was 
invited as “Foreign Lecturer” for the fall term 1925/26 at M.I.T. Thus, 
Bom left Gottingen at the end of October, 1925, a few days after the 
completion of the “Drei-Manner-Arbeit,” to join Wiener. Bom realized 
that matrix mechanics, as formulated so far, was incapable of treating 
aperiodic phenomena such as uniform rectilinear motion, for in such a 
case no periods are present and the coordinate matrix q has no off-diag¬ 
onal elements at all. Bom and Wiener, therefore, decided to search for a 
generalization of matrix mechanics which would embrace periodic as well 
as aperiodic phenomena. 

Now, a few months before Bom’s arrival in Cambridge, Wiener had 
written a comprehensive article 83 on the operational calculus in which he 
discussed the contrast between the rigorous conception of operators, such 
as Volterra’s integral transformations (integral operators) or Pincherle’s 



Formation of Quantum Mechanics 


229 


transformations of one power series into another, and the heuristic ap¬ 
proach, “devoid even of the pretense of mathematical rigor,” such as 
Heaviside’s method. The main topic of the paper, however, was Wiener’s 
proposed generalization of the Fourier integral which would make it pos¬ 
sible to apply integral operators rigorously also to nonanalytic functions. 

In his autobiography Wiener gave the following account. “When Pro¬ 
fessor Bom came to the United States he was enormously excited about 
the new basis Heisenberg had just given for the quantum theory of the 
atom. Bom wanted a theory which would generalize these matrices.... 
The job was a highly technical one, and he counted on me for aid.... I had 
the generalization of matrices already at hand in the form of what is 
known as operators. Bom had a good many qualms about the soundness 
of my method and kept wondering if Hilbert would approve of my math¬ 
ematics. Hilbert did, in fact, approve of it, and operators have since re¬ 
mained an essential part of quantum theory.” 84 

Let us now give a brief description of how Bom and Wiener introduced 

operators into quantum theory. If y(t) = ^ y m exp( 2mW m t/h) and, 

m 

similarly, x(0 = ^x n exp( 2mW n t/h), it was easy to see that 

n 

1 C , s - 2 mW n s J 

x n = lim - x(s)ex p- as 

t= oo 2 T J — t h 

and, hence, that the matrix transformation y m = ^q mn x n entails 

n 

y(t) = lim —— f q(t,s)x(s)ds (5.40) 

T= oo 2 T J-T 

where q(t,s) = ^ q mn exp[27n( W m t — W n s)/h ]. Equation (5.40) was 

mn 

now interpreted as follows. The “operator” 

1 r 

q= lim - dsq(t,s)... (5.41) 

T= OO 2 T J — T 

transforms x(0 into>>(£) or, symbolically, 

y(t) = qx(t) (5.42) 

Generalizing Eq. (5.42), Bom and Wiener defined operators as follows: 85 
“An operator is a rule in accordance with which we may obtain from a 
function x(t) another function y(t ).... It is linear if 


q(x(t) + jKO) =q*(t) +qy(t)” 


(5.43) 
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For the linear operator D = d /dt , 

y(t) =Dqx(t) = lim — f x(s)ds 

T = oo 2T J — t dt 

which shows that Dq conforms to (5.41) if only q(t,s) in (5.41) is re¬ 
placed by the generating function 


dq ( t,s ) 2m ^ 

s — T w - “ p 


mn 


2m 


(W m t-W n (s) 


The matrix associated with Dq, just as ( q mn ) was with q, is, therefore, 

{Dq) mn = 0^ W m q mn ) =?yWq 


where W — ( W n S mn ). Similarly the matrix associated with the operator 
qD , as a partial integration shows, is 

(qD) mn =^j-W n q mn ' S j = ^-qW 

The matrix associated with the operator Dq — qD has therefore the ele¬ 
ments 


2 m 

—(W —W) a =2mv a 

, V rr m vw n 'Hmn x,//1 * m n Hmn 

h 

since hv mn = W m - W„. 

Bom and Wiener consequently defined the time derivative q of an oper- 


ator q by the operator equation 


ii 

i 

(5.44) 

In addition, the commutation relation 


h t 

M 9P= . . 1 

2m 

(5.45) 

as well as the canonical equations 


dH(pq) dH ( pq) 

9 dp P dq 

(5.46) 


were interpreted as operator equations for the Hermitian operators p and 
q, that is, operators whose associated matrices are Hermitian. 

On these assumptions they derived the energy theorem and proved that 
the energy operator H is identical with (h /2m)D. Finally, by solving the 
problems of the harmonic oscillator (iq + co\q = 0) and of the one-dimen¬ 
sional uniform motion (£ = 0), they demonstrated the applicability of 
their new operator mechanics to both periodic and aperiodic phenomena. 

It is interesting to note that, had Bom and Wiener supplemented their 
equations (5.44), (5.45), and 
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H = — D (5.47) 

2-iri 

by the equation 

h d 

p = - 

2m dq 

as a comparison of (5.44) with (5.45) could have suggested, they would 
have established an operator formulation of wave mechanics. Or as Bom 
declared: “We expressed the energy as d /dt and wrote the commutation 
law for energy and time as an identity by applying [t(d/dt) — ( d/dt)t ] to 
a function of t; it was absolutely the same as for q and p. But we did not see 
that. And I never will forgive myself, for if we had done this, we would 
have had the whole wave mechanics from quantum mechanics at once, a 
few months before Schrodinger.” 86 

Yet in spite of this oversight, their idea of representing physical quanti¬ 
ties, such as coordinates or momenta, by operators was a remarkable 
innovation, not only because it made phenomena, which so far had defied 
matrix mechanics, amenable to analytic treatment, but also because the 
operator formalism, as we shall see later on, provided the most lucid 
representation of quantum-theoretic relations and eventually led to a pro- 
founder understanding of the theory. Since the notion of operators will 
gain increasing importance in our subsequent discussions, a brief digres¬ 
sion on the development of this conception seems to be appropriate. 

Strictly speaking, the operator concept is as old as symbolic algebra, for 
every symbol in classical algebra denotes a precept either to substitute a 
number or to perform an arithmetic operation. If we define, however, the 
concept of an operator in a nontrivial manner as referring to the transfor¬ 
mation of a function to another function, as it was defined by Bom and 
Wiener, its origin can be traced back to the discovery of the differential 
calculus. In fact, it is often claimed that Leibniz 87 was the first to use 
operators in this sense as, for instance, when he stated that the modus 
operandi of the differential operator d n (xy) shows a striking analogy to 
the expansion of (x + y) n . Leibniz’s approach was further elaborated by 
Lagrange, 88 who generalized it to a “new kind of calculus concerning 
differentiation and integration.” In this calculus, which may be regarded 
as the earliest operator calculus in history, Lagrange employed d /dx sys¬ 
tematically as a fictitious quantity, subject to ordinary algebra as, for 
example, in the “Taylor expansion”/(x + h ) = exp [h (d /dx) ] f(x ). La¬ 
grange’s work was continued by A. M. Lorgna, J. P. Griison, L. F. A. 
Arbogast, M. J. F. Francais, S. F. Lacroix, and, especially, Francois- 
Joseph Servois, 89 who was the first to pay attention to the formal aspects of 
symbolic operations and introduced the notions of “commutativity” and- 



232 


Conceptual Development of Quantum Mechanics 


“associativity” of operators. The earliest detailed study of operator for¬ 
malisms in abstracto was probably Robert Murphy’s “First memoir on the 
theory of analytical operations,” 90 published in 1837. Distinguishing 
clearly between operand, operator, and result—in Eq. (5.42) these corre¬ 
spond to x(t), q , andjKO, respectively—Murphy stated: “The elements 
of which every distinct analytical process is composed are three, namely, 
first the Subject , that is, the symbol on which a certain notified operation is 
to be performed; secondly, the Operation itself, represented by its own 
symbol; thirdly, the Result , which may be connected with the former two 
by the algebraic sign of equality. Thus let a be the subject representing, we 
may suppose, some quantity, b the symbol for multiplication by b , and c 
the result or product; for greater distinctness let the subject be inclosed in 
square brackets, the analytical process in this case is [a]b = c. Again, let 
x n be the subject, a symbol of operation denoting that x must be changed 
into x + h f and (x + h) n will evidently be the result, or 
[x n ]tfr= ( x + A)".” Operators may commute or not, or as Murphy, ap¬ 
parently unaware of Servois’s terminology, expressed it, “operations 
are...relatively fixed or free\ in the first case a change in the order in which 
they are to be performed would affect the result, in the second case it 
would not do so.” Thus, to quote Murphy’s example, [x n ]xift ^ [ x n ]if/x , 
whereas [x n ]aift = [x n ] xfta, both being equal to ax + h n . 

The use of operators in pure mathematics for the solution of differential 
equations, a procedure familiar to every student of higher mathematics, 
characterizes the second stage in the development of the operator concep¬ 
tion. This phase was initiated by George Boole’s paper “On a general 
method in analysis,” 91 for which its author was awarded a gold medal by 
the Royal Society; it was given a detailed exposition in Boole’s classic 
Treatise on Differential Equations. 91 Boole’s introductory remarks to his 
method in the Treatise are still interesting also from our modem point of 
view and deserve to be mentioned. “The question of the true value and 
proper place of symbolic methods,” wrote Boole, 93 “is undoubtedly of 
great importance. Their convenient simplicity—their condensed power— 
must ever constitute their first claim upon attention. I believe however 
that, in order to form a just estimate, we must consider them in another 
aspect, viz. as in some sort the visible manifestation of truths relating to 
the intimate and vital connexion of language with thought—truths of 
which it may be presumed that we do not yet see the entire scheme and 
connexion.” Later, under the heading “Forms purely symbolical,” he 
declared: 94 “In any system in which thought is expressed by symbols, the 
laws of combination of the symbols are determined from the study of the 
corresponding operations in thought. But it may be that the latter are 
subject to conditions of possibility as well as to laws when possible. And 
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thus it may be that two systems of symbols, differing in interpretation, 
may agree as to their formal laws whenever they both express operations 
possible in thought, while at the same time there may exist combinations 
which really represent thought in the one but do not in the other. For 
instance, there exist forms of the functional symbol/ for which we can 
attach a meaning to the expression f(m), but cannot directly attach a 
meaning to the symbol fid /dx). And the question arises: Does this differ¬ 
ence restrict our freedom in the use of that principle which permits us to 
treat expressions of the form fid /dx) as if d /dx were a symbol of quanti¬ 
ty? For instance, we can attach no direct meaning to the expression 
exp [h(d/dx) ] fix), but if we develop the exponential as if d/dx were 
quantitative, we have 

exp [h-^-\f(x) = (\ + h^ + -^—h 2 -^ + etc. > \fix) =fix + h) 

\ dx) \ dx 1 *2 dx J 

by Taylor’s theorem. Are we then permitted, on the above principle, to 
make use of symbolic language; always supposing that we can, by the 
continued application of the same principle, obtain a final result of inter¬ 
pretable form? Now all special instances point to the conclusion that this 
is permissible, and seem to indicate, as a general principle, that the mere 
processes of symbolical reasoning are independent of the conditions of 
their interpretation.” Boole’s principle of admitting symbolic reasoning, 
even if no immediate interpretation is in sight but has to be searched for, 
has proved extremely useful in the later development of quantum mechan¬ 
ics. The elaboration of symbolic techniques for the solution of differential 
equations—as, for example, by Hargreave, Gregory, Brouwin, Carmi¬ 
chael, Forsyth—became in the second half of the last century, especially 
among British mathematicians, a prolific subject of research. 

Of special interest from our point of view is a development which began 
when Charles James Hargreave presented a paper 95 in June, 1847, to the 
Royal Society in London on applications of the operator function 
q){d/dx) to the solution of differential equations, such as, for example, 
Riccati’s equation, in the course of which questions concerning the inter¬ 
change of symbolic operators began to play an important role. 

Hargreave’s work was soon generalized by Charles Graves, Erasmus 
Smith Professor of Mathematics at the University of Dublin from 1843 to 
1862 and Bishop of Limerick until his death in 1866. In a series of papers 96 
read before the Royal Irish Academy, Graves, a mathematician, theolo¬ 
gian, linguist, and archaeologist, developed an extensive and powerful 
operator formalism which, in one aspect at least, had an influence not only 
on Heaviside but even on Dirac 97 at an important point of his work. 
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Graves’s paper 98 “On the principles which regulate the interchange of 
symbols in certain symbolic equations” began as follows: “Let ir and p be 
two distributive symbols of operation, which combine according to the 
law expressed by the equation 

pir = irp + a 

a being a constant, or at least a symbol of distributive operation commuta¬ 
tive with both ir and p” Graves then showed that in any symbolic equa¬ 
tion <p ( 7 r,p) = 0, ir may be exchanged with p and p with — ir, yielding the 
“correlative equation” cp{p, — ir) = 0. He also proved by induction that 
pif 1 = ifp + nif 1 ~~ 1 and pifm = xfmp + xfj'i r, where x/m denotes any func¬ 
tion of integral powers of ir. It will have been recognized that Graves’s 
reasoning and formal results recurred in the quantum-mechanical treat¬ 
ment of noncommutative quantities as, for example, in Dirac’s ^-number 
algorithm and its handling of the quantum conditions. 

The third stage in the development of the operator conception may be 
characterized by the application of operators to problems in applied math¬ 
ematics and, in particular, to the analysis of electric networks. The most 
important—and provocative—contribution was Oliver Heaviside’s appli¬ 
cation of operators to electromagnetic problems. To analyze networks of 
linear conductors with inductances, capacities, and resistances, Heavi¬ 
side 99 replaced the differential operator D = d /dt by p and regarded the 
latter as subject to the ordinary laws of algebra. Applying this method also 
when partial differential equations were involved, he paid little attention 
to questions of mathematical rigor. “Mathematics,” he declared, 100 “is an 
experimental science, and definitions do not come first, but later on.” The 
widespread use of the delta function prior to its vindication by the theory 
of distributions shows that also the development of quantum mechanics 
was not exempt from Heaviside’s antipuristic attitude. 

The next and last stage in the development of the operator conception, 
before its use in quantum mechanics, is characterized by the effort to 
justify the previous methods by rigorous proofs, either by making use of 
the theory of functions of a complex variable, as, for example, in the work 
of T. J. I’A. Bromwich, or, more generally, by the use of integral trans¬ 
forms, on the historical development of which the reader is referred to S. 
Pincherle’s excellent summary in the Encyklopddie der mathematichen 
Wissenschaften. 101 It was in this connection that Wiener became interest¬ 
ed in the operator calculus and introduced, together with Bom, this pow¬ 
erful mathematical tool into quantum mechanics. 

In concluding our brief survey on the early development of operators in 
mathematical physics, we should like to draw the reader’s attention to a 
rather unknown paper by Ludwik Silberstein, 102 which anticipated to 
some extent the formal aspects of the operational approach in modem 
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quantum mechanics. The study of symbolic integrals of the equations of 
the electromagnetic field 103 suggested to Silberstein, who is known mostly 
for his writings on the theory of relativity, a theory of “physical opera¬ 
tors,” in terms of which he attempted to give a unified representation of 
such disparate phenomena as mechanical oscillations, heat conduction, 
and electrodynamic processes. Defining the “state” (“Zustand”) of a 
physical system by a time-dependent function ifr(t), Silberstein intro¬ 


duced what he called “chrono-operators” 


H 

. t. 


by means of which the 


state at time t = t can be determined from the knowledge of the state at 


time t = 0, in accordance with the equation ip{t) = 


the inverse operator 
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and that 


H 
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{( e tF }, where F is another operator, connected with H in a definite 


manner, but independent of t. Silberstein even spoke of the superposition 
of states and mentioned numerous other details which, including the nota¬ 
tion, made their appearance in the operational formalism of quantum 
mechanics some twenty-five years later. “Every class of physical phenom¬ 
ena,” declared Silberstein 104 in 1901, “or at least those amenable to quan¬ 
titative treatment, will be characterized by corresponding physical opera¬ 
tors; and the scientific study of natural phenomena will proceed by 
detailed examinations of the properties of these operators, based on obser¬ 
vation and nurtured by experiment.” 

Whereas in Cambridge, Massachusetts, matrix mechanics was general¬ 
ized into an operator calculus by Bom and Wiener, in Cambridge, Eng¬ 
land, it was modified into an algebraic algorithm by Paul Adrien Maurice 
Dirac. Trained as an electrical engineer—Dirac obtained a B.Sc. in elec¬ 
trical engineering from the University of Bristol in 1921—but unable to 
find a job in the years of depression, Dirac continued his studies, primarily 
under the mathematician Peter Fraser, in Bristol until he obtained in 1923 
a senior research scholarship for three years at St. John’s College in Cam¬ 
bridge. Thus, during the years 1923 to 1926, when he obtained his Ph.D., 
he worked under R. H. Fowler and published a series of papers on the 
relativistic dynamics of particles, the Doppler principle in Bohr’s theory, 
the adiabatic principle, and other problems in the older quantum theo¬ 
ry. 105 In the course of this work he became thoroughly acquainted with 
the Hamilton-Jacobi formulation of classical mechanics and the use of 
uniformizing variables (angle and action variables), primarily through 
Whittaker’s Analytical Dynamics , the standard treatise on this subject in 
England. 
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Heisenberg, it will be recalled, shortly after his return from Heligoland, 
had left Gottingen for Cambridge in July, 1925, to lecture at the Caven¬ 
dish Laboratory. One of his lectures, entitled “On Term-zoology and Zee- 
man-botany,” 106 delivered on July 28, was sponsored by the Kapitza 
Club, a weekly colloquium, named after Peter Leonidovich Kapitza, who 
at that time was the assistant director of magnetic research at the Caven¬ 
dish. In all his Cambridge lectures, whether in his public lectures at the 
laboratory or in his colloquium talk at the club, Heisenberg spoke only 
about the older quantum theory and did not even refer to his new discover¬ 
ies, probably because he was not yet assured of the correctness of his new 
approach. Dirac, in fact, became acquainted with the new mechanics not, 
as it is so often alleged, through Heisenberg’s Cambridge lectures at the 
end of July, but only early in September, several weeks after Heisenberg 
had left Cambridge. For at that time Fowler obtained from Bohr the 
proofs of Heisenberg’s paper 107 and showed them to Dirac. At first, Dirac 
“saw nothing useful in it,” but after about two weeks he “saw that it 
provided the key to the problem of quantum mechanics.” 108 

Yet, convinced of the indispensability of Hamilton’s mechanics for the 
study of atomic physics, Dirac was not satisfied with Heisenberg’s exposi¬ 
tion and tried to adapt it to the Hamiltonian formalism. In a few weeks’ 
time he achieved his objective and thus established one of the most pro¬ 
found and useful relations between quantum mechanics and the classical 
Hamilton-Jacobi formulation of mechanics. He obtained this result by 
recasting Heisenberg’s mechanics into an algebraic algorithm on the basis 
of which he expected to derive all the formulas of the quantum theory 
without any explicit use of the Heisenberg products (matrices). Fowler 
immediately recognized the importance of this work and urged Dirac to 
publish his results, incomplete as they still were. Thus, Dirac’s paper “The 
fundamental equations of quantum mechanics,” completed on November 
7, 1925, appeared in the December issue of the Proceedings of the Royal 
Society 109 and marked the beginning of his celebrated contributions to 
quantum mechanics. 

Starting from Heisenberg’s representation of quantum-mechanical 
quantities by means of amplitudes x nm and frequencies v nm , subject to the 
multiplication rule 


{xy) nm =^x nk y km (5.48) 

k 

Dirac determined the form of the most general quantum operation d /dv, 
which satisfies 
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(5.50) 


From (5.50) he concluded that the amplitudes of the components of 
dx/dv are linear functions of those of x or 
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(5.51) 


Substitution of (5.51) in (5.50) yielded 
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This equation being valid for all values of the amplitudes, Dirac compared 
the coefficients of x n . k y k . m . on either side and obtained 


< 5 kk' a nm;n'm' ~ &mm' a nk' ;ri k + ^nn' a km;k 
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(5.53) 


Taking k — k\ he showed that a nm . n > m . vanishes unless n = ri or m = m' 
(or both); for k = k ', m = m\ n^ri he obtained a nm . n > m = a nhn . ki which 
showed that a nm . n > m for n^ri is independent of m; similarly, a nm . nm , for 
m^rri was shown to be independent of n. For k^k\m = m’> n = ri he 
obtained a nk , nk + a km . Xm = 0. Thus, for k a nk ,. nk =a kk - 

= ^km;k‘m * Finally, with k = k', m = rri, n = ri, a nm;nm = a nk . nk 
+ a km ;km = <*mm - a nn and Eq. (5.51) reduced to 
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or 


dx 

dv 


= xa — ax 


(5.54) 


which showed that the most general operation on a “quantum variable” x , 
subject to (5.49) and (5.50), is performed by “taking the difference of its 
Heisenberg products with some other quantum variable.” As the use of 
the term “Heisenberg product” clearly indicates, Dirac did not yet realize 
that Heisenberg’s theory was a matrix formalism, as Bom and Jordan had 
shown in their paper, 110 completed in September. 111 

Dirac now investigated to what the quantum-mechanical quantity 
xy — yx corresponds in the classical theory. Classically, 112 x = x(J,w) = 

^x T (J) exp {2mrw) t y=y(J,w) = ^y r (J)exp(2mrw) f where J and w 

T T 

are action and angle variables, and x T (J) =x T (nh) corresponds, in ac¬ 
cordance with Bohr’s correspondence principle, to x n n _ r . Hence 
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and similarly for x T . Hence, Dirac concluded, the nm component of the 
quantum-mechanical expression xy — yx corresponds to 

d d 

—[x T exp(2mrw) ]— [y a exp(2iricnv) ] 

oJ dw 


2m T +a=n- 


d d 
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dJ dw 
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or 


where 


xy — yx ++ 


ih 

2 ir 


(x,y) 


(5.56) 


(x,y) = 


dx dy 
dw dJ 


dy dx 
dw dJ 


(5.57) 


is the classical Poisson bracket expression for x and y , which is known to 
be invariant under a canonical transformation. Since for classical canoni¬ 
cal variables p cl , q cl ( q cl ,p cl ) = 1, (q cl ,q cl ) = (p cl ,p cl ) = 0, their corre¬ 
sponding quantum-mechanical variables p>q satisfy 


pq-qp = 


h 

2m 


(5.58) 


which is the Bom-Jordan commutation relation (5.25), page 220. In ac¬ 
cordance with (5.56) Dirac could take over into quantum mechanics the 
whole of classical mechanics as far as it could be formulated in terms of 
Poisson brackets instead of derivatives. In particular, it followed from 
(5.56) that for any function f(p,q) of the quantum-mechanical variables p 
and q the equation of motion is 


df _ 2ir 
dt ih 


( fH-Hf) 


(5.59) 


where H = H(pq) is the Hamiltonian of the system. 
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Dirac could easily generalize Eq. (5.58) to the “quantum conditions” 
for multiply periodic systems of / degrees of freedom—in fact, in his pa¬ 
per he expounded his theory for this more general case from the very 
beginning—and obtained for r, s = 1,2,...,/ 


QrQs ~ <ls<lr = 0 

P rPs ~ PsP r = 0 


Pr^ls QsPr ^rs 


h 

2m 


(5.60) 


Applying (5.56) to the quantum-mechanical coordinate x and the 
Hamiltonian H , which represents the time-independent energy of the sys¬ 
tem and has, therefore, according to Heisenberg, nonvanishing ampli¬ 
tudes only for n = m, Dirac inferred that 


y rr _ FT X — 

^ nmmm nn’*' nm 


ih 

2iT 


^nm 


— hv x 

nm nm 


or 

hv =H —H 

* r nm nn mm 

which is “just Bohr’s relation connecting the frequencies with the energy 
differences.” 113 

It will have been understood that the decisive step in Dirac’s proof of 
the correlation (5.56) between quantum-mechanical variables and their 
classical prototypes was the use of Bohr’s correspondence principle. By 
assigning to (5.56) or rather to its equivalent, the quantum conditions 
(5.60), a postulatory status within the conceptual structure of the theory, 
Dirac absorbed, so to speak, the correspondence principle as an integral 
part into the very foundations of his theory and, like Heisenberg, disposed 
thereby of the necessity of resorting to Bohr’s principle each time a prob¬ 
lem had to be solved. 

The quantization of classical mechanics, if carried out in accordance 
with (5.56) or, equivalently, (5.60), is more convenient than in Heisen¬ 
berg’s matrix mechanics. Dirac owed this advantage of his theory obvi¬ 
ously to the utilization of the Poisson brackets, one of the most powerful 
analytical tools in classical dynamics. As the name indicates, these brack¬ 
ets had been introduced in mechanics by Simeon Denis Poisson in a pa¬ 
per 1 14 read at the Institute de France on October 16,1809. Working on the 
integration of the equations of motion in the perturbation theory of plan¬ 
etary orbits, Poisson realized the usefulness of these brackets for this pur¬ 
pose. Their remarkable significance, however, for theoretical mechanics 
in general was fully recognized only in 1842-1843 by Jacobi when he 
discussed these brackets, and Poisson’s theorem in terms of them, in his 
famous lectures 115 on dynamics at the University of Konigsberg. 
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As is well known, Poisson’s theorem, which Jacobi once called “la plus 
profonde decouverte de M. Poisson,” 116 can easily be proved by the use of 
what is known as the “Jacobi identity” 

((x,y),z) 4- (O',*),*) 4- (( z,x),y ) = 0 (5.61) 

The proof of this identity in classical mechanics, whether carried out by 
direct calculation or indirectly, 117 is more laborious than the proof of its 
quantum-mechanical analogue, as Dirac has pointed out. 118 

In a second paper, 119 dealing primarily with the applicability of the 
algebraic formalism to specific problems, Dirac called the quantum- 
mechanical variables “^-numbers,” 120 in contradistinction to the numbers 
of classical physics, which satisfy the commutative law and which he 
called “c-numbers.” “At present,” Dirac declared, “one can form no pic¬ 
ture of what a ^-number is like. One cannot say that one ^-number is 
greater or less than another.” 121 “In order to be able to get results compar¬ 
able with experiment from our theory, we must have some way of repre¬ 
senting ^-numbers by means of c-numbers, so that we can compare these 
c-numbers with experimental values. The representation must satisfy the 
condition that one can calculate the c-numbers that represent x 4- y, xy, 
and yx when one is given the c-numbers that represent x and y. If a q- 
number is a function of the co-ordinates and momenta of a multiply peri¬ 
odic system, and if it is itself multiply periodic, then it will be shown that 
the aggregate of all its values for all values of the action variables of the 
system can be represented by a set of harmonic components of the type 
x nm exp(2mv nm t), where x nm and v nm are c-numbers, each associated 
with two sets of values of the action variables denoted by the labels n and 
m, and t is the time, also a c-number. This representation was taken as 
defining a ^-number in the previous papers on the new theory [Dirac here 
referred to the Bom-Jordan and the Bom-Heisenberg-Jordan papers]. It 
seems preferable though to take the above algebraic laws and the general 
conditions [ (5.60) ] as defining the properties of ^-numbers, and to de¬ 
duce from them that a ^-number can be represented by c-numbers in this 
manner when it has the necessary periodic properties. A ^-number thus 
still has a meaning and can be used in the analysis when it is not multiply 
periodic, although there is at present no way of representing it by c- 
numbers.” 122 

As this quotation shows, Dirac already envisaged at this stage the idea 
of a postulatory account of quantum mechanics which leads to a concep¬ 
tually autonomous and logically consistent system, even if its foundations, 
underlying the postulates, are deeply rooted in classical dynamics and its 
ultimate interpretation, connecting the calculus with experience, requires 
likewise recourse to classical physics. In any case, in the summer of 1926 
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Dirac completed a postulatory exposition of the algebra of ^-numbers 
which he published in a paper 123 entitled “The quantum algebra.” 

Dirac also admitted, as we see, that his ^-number algorithm—in con¬ 
trast to the Bom-Wiener operator calculus—could not yet account for 
nonperiodic phenomena. On the other hand, as if to compensate for this 
deficiency, Dirac derived on his theory the frequencies of the hydrogen 
atom, a result which Bom and Wiener were unable to obtain at that time. 

It should be noted, however, that at the same time also Pauli, 124 on the 
basis of matrix mechanics, arrived at the solution of the hydrogen problem 
and, in addition, gave a matrix-mechanical derivation of the Stark effect. 
Finally, Pauli showed in the same paper how the new matrix mechanics 
could easily solve the problem of crossed electric and magnetic fields (that 
is, the problem of calculating the energy perturbations in the hydrogen 
spectrum if the atom is acted on by such fields), a problem which gave rise 
to insurmountable difficulties if subjected to the treatment of the older 
quantum theory. 125 As a matter of fact, Pauli completed his matrix-me¬ 
chanical calculation of the Balmer levels in October, a week or two before 
Dirac began to work on his second paper. On November 3, Heisenberg 
wrote to Pauli, after having received from him a draft of his computations: 
“I need not tell you how delighted I am about the new theory of hydrogen 
and how pleasantly surprised I am about the speed with which you pro¬ 
duced this theory.” 126 About Pauli’s successful treatment of the problem 
of crossed fields Heisenberg should have rejoiced even more. For if Pauli’s 
matrix-mechanical derivation of the hydrogen spectrum showed that Hei¬ 
senberg’s theory is not inferior to the older quantum theory, Pauli’s solu¬ 
tion of the problem of crossed fields proved its superiority over the latter. 

It was, however, not the first piece of evidence for this superiority. For 
it will be recalled that according to Heisenberg’s matrix mechanics the 
energy of the harmonic oscillator in the lowest stationary state was not 
zero, as inferred by the older quantum theory, but \h v 0 . The notion of such 
zero-point energy, which, as we have seen, was proposed by Planck’s so- 
called “second theory” 127 of 1911 and soon rejected, was nonetheless nev¬ 
er completely obliterated. In fact, there were a number of experimental 
findings, as, for example, those cited by Bennewitz and Simon, 128 which 
seemed to support the idea. Finally, at the end of 1924, more than half a 
year before the publication of Heisenberg’s historic paper, the existence of 
the zero-point energy was irrefutably established. In his study of the iso¬ 
tope effect in the vibration band spectrum of boron monoxide Mulliken 129 
observed that the shift corresponding to the transition S 2 = 0 to S x = 0 is 
given by 130 \h [ (r 2 — r 2 ) — ( v x — v \) ] a result which could not be ac¬ 
counted for without the assumption of the zero-point energy. Mulliken’s 
observation thus conflicted with the older quantum theory but was in full 
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agreement with the subsequently proposed matrix mechanics. It was the 
earliest evidence of the superiority of quantum mechanics over the older 
theory. 


5.3. The Rise of Wave Mechanics 

Simultaneously with the rise of matrix mechanics another, apparently 
unrelated, conceptual development took place, whose point of departure 
was not the mechanics of particles but rather the problem of the nature of 
light, which had challenged man’s thirst for knowledge ever since he be¬ 
gan to reflect on the physical world. For light, although the principal 
agent of our knowledge of the external world, never revealed its own 
identity and, although the intermediary between matter and the finest of 
man’s senses, never removed the veil of its mystery. 

Classical physics, as is well known, offered two major hypotheses on 
the nature of light: Newton’s corpuscular doctrine and Huygens’s undula- 
tory theory. The first to attempt a reconciliation of these two was Sir 
William Rowan Hamilton, an ardent admirer of Lagrange. At a time 
when the wave theory of light, primarily through the work of Fresnel, 
achieved its most spectacular successes, Hamilton proposed to give to 
optics, as a formal science, the same “beauty, power and harmony” which 
Lagrange gave to mechanics. This he tried to achieve by finding one single 
law of nature which governs both the propagation of light and the motion 
of particles. 131 With the help of his “characteristic function” he discov¬ 
ered that the actual motion of a point mass in a field of force is governed by 
the same formal law as the propagation of the rays of light. 

In fact, it is often forgotten that Hamilton’s famous formulation of 
mechanics originated in his investigations on optics. Modifying his pre¬ 
sentation somewhat for future reference and using modem notation, 132 we 
may summarize his results as follows. 

From Hamilton’s variational principle 5fL dt = 0 it is easy to derive, 
for constant energy E , Maupertuis’s principle of least action, the oldest 
variational principle of dynamics, 8$2Tdt = 0 or 

8\[2m(E-U)Y l2 ds = 0 (5.62) 

Comparing (5.62) with Fermat’s principle of minimum time, the basis of 
geometrical optics, 




0 


( 5 . 63 ) 
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we see that the expression C[2m(E — U) ] 1/2 plays in mechanics the 
same role as the phase velocity u = c/n in optics, or symbolically, 

u = -?-++C[2m(E-U)]~ in (5.64) 

n 

a relation which may be called “Hamilton’s optical-mechanical analogy.” 
This analogy, however, can be extended still further. The action function 
S = S[ o L dt defines in configuration space an “action surface” S(x,y,z 9 t) 
= const for which, as is well known,/? = VS and dS /dt = L — pv= — E. 
On the other hand, for plane monochromatic waves the wave vector k and 
the frequency co = 2ttv are clearly related to the phase cp = —cot + kr by 
the equation k = Vcp and dcp /dt = — co. Hence, the surface of constant 
action of a system of particles is propagated in complete analogy to the 
surface of constant phase in optics, the wave vector corresponding to the 

momentum, and the frequency to the energy of the particle. 

< 

These results, which Hamilton 133 had published between 1828 and 
1837, found little attention for almost a hundred years. Hamilton’s me¬ 
chanics, disjoined from his optics, was, as we know, 134 elaborated and 
reformulated in a most profound way by Jacobi. Hamilton’s optics, dis¬ 
joined from his mechanics, reappeared in Bruns’s theory of the eikonal. 135 
But the synthetic point of view, the insistence on a joined formalism of 
optics and mechanics, which was the chief concern of Hamilton, was 
almost completely ignored. A notable exception was Felix Klein. He re¬ 
peatedly stressed the importance of Hamilton’s approach, as, for example, 
in his lecture on November 22,1891, at the Convention of German Scien¬ 
tists in Halle, 136 and deplored the lack of interest in Hamilton’s ideas on 
the part of his contemporaries. Another exception was G. Prange, 137 who 
dedicated his “Habilitationsrede” in 1921 at the University of Halle to 
Hamilton’s contribution to geometrical optics and their relations to me¬ 
chanics. 

The indifference with which former generations viewed Hamilton’s 
conceptions can easily be explained. For, although the rays, determined 
by Fermat’s principle, do correspond to the trajectories, determined by 
Maupertuis’s principle, and the wave surfaces to those of Hamilton’s ac¬ 
tion, no correspondence could be established between the velocity of the 
particle and that of the ray. Hamilton’s formalism was therefore regarded 
as only a formal analogy, devoid of any deeper physical significance. 

According to Hamilton, Newtonian mechanics corresponds to geo¬ 
metrical optics. But the latter is only an approximation of undulatory 
optics. Thus, even if it had been realized at that time that the validity of 
Newtonian mechanics, in view of its correspondence with geometrical 
optics alone, has limitations and that a more exact mechanics ought to 



244 


Conceptual Development of Quantum Mechanics 


assume undulatory characteristics, the conceptual apparatus of nine¬ 
teenth-century science would not yet have been ready to accept the con¬ 
clusion that, just as rays had to be replaced by waves in optics, trajectories 
of particles had to be superseded by some other conception; for this would 
have necessarily led to the abrogation of the very notion of the particle 
itself. For these reasons, not theoreticians but experimentalists were the 
first to face the wave-particle problem. 

From Wilhelm Konrad Rontgen’s discovery of x-rays 138 until the fam¬ 
ous diffraction experiments performed on Laue’s suggestion by Friedrich 
and Knipping, 139 and even later, the question whether these rays are parti¬ 
cles or waves remained unsettled. Their emission from radioactive atoms, 
together with a and f3 rays, about whose corpuscular nature there seemed 
to be no doubt, their power of ionizing 140 a gas through which they pass, 
and other of their properties suggested a corpuscular nature. W. H. Bragg, 
even after he had shown that this ionization resulted only indirectly from 
the ejection of f3 rays or electrons, declared—anticipating an important 
feature of the Compton effect: 141 “Owe x-ray provides the energy for one 
beta-ray, and similarly in the x-ray bulb, one beta-ray excites one x-ray... 
the speed of the secondary beta-ray is independent of the distance that the 
x-ray has traveled: so the x-ray cannot diffuse its energy as it goes, that is 
to say, it is a corpuscle.” 142 In a paper written conjointly with Porter, 
Bragg stated: “Energy considerations lead us directly to the supposition 
that the x- and gamma-rays are corpuscular in nature in so far as each ray 
is a separate identity moving through space unaltered in form and energy- 
content, just as an unhindered projectile would do.” 143 

These and similar considerations concerning the properties of x-rays 
led to the paradoxical situation that even after the diffraction experiments 
had established in an unequivocal manner the undulatory character of 
these rays, the possibility of these being at the same time also something 
corpuscular was not wholly dismissed. In fact, reviewing the situation at 
the end of 1912, W. H. Bragg made this profound and prophetic state¬ 
ment: “The problem becomes, it seems to me, not to decide between two 
theories of x-rays, but to find, as I have said elsewhere, one theory which 
possesses the capacity of both.” 144 

Bragg’s view on the nature of x-rays was shared by a number of experi¬ 
mentalists—among them Maurice de Broglie, an elder brother of Louis de 
Broglie. He has already been mentioned as the editor of the Proceedings of 
the first Solvay Congress 145 and as a staunch supporter of Compton in the 
debate with Duane. 146 His laboratory in the rue Byron in Paris specialized 
in the study of x-rays. “My brother,” declared Prince Louis-Victor de 
Broglie, 147 “considered x-rays as a combination of wave and particle, but, 
not being a theoretician, he did not have particularly clear ideas on this 
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subject.” It was in this laboratory where Maurice initiated his brother to 
experimental research 148 and where he “insistently drew his attention to 
the importance and undeniable reality of the dual aspects of particle and 
wave.” 149 In his autobiographical notes Louis de Broglie 150 wrote: “I had 
long discussions with my brother on the interpretation of his beautiful 
experiments on the photoelectric effect and corpuscular spectra.... These 
long conversations with my brother about the properties of x-rays...led me 
to profound meditations on the need of always associating the aspect of 
waves with that of particles.” 

These speculations, however, were not the only factor which deter¬ 
mined Louis de Broglie’s direction of thought. It should be recalled that 
he began his academic studies in preparation for a career in the civil ser¬ 
vice at the Faculte des Lettres of the University of Paris and in 1910 
obtained his L. es Sc. (Licencie es Sciences) in history. However, partly as 
a result of his brother’s influence, partly through the impact of Poincare’s 
Value of Science and Science and Hypothesis , he became so fascinated with 
the problems of physics and its philosophy that he decided to change his 
career. In 1913 he obtained his L. es Sc., in the physical sciences at the 
Sorbonne. When he continued his formal studies later at the Faculte des 
Sciences of the Sorbonne toward a D. es Sc., he became particularly inter¬ 
ested in those branches of physics which dealt with the fundamental prob¬ 
lems of time, space, and the structure of matter and light. Thus, for exam¬ 
ple, Paul Langevin’s lectures on relativity and his analysis of the concept 
of time—Langevin was the first to treat in detail the so-called “clock 
paradox” in relativity—made a great impression on Louis de Broglie. In 
fact, his conception of phase waves was intimately related to his reflec¬ 
tions on the nature of time. “This difference between the relativistic varia¬ 
tions of the frequency of a clock and the frequency of a wave,” he wrote, 151 
“is fundamental; it had greatly attracted my attention, and thinking over 
this difference determined the whole trend of my research.” 

Before we analyze Louis de Broglie’s decisive contributions to the con¬ 
ceptual development of quantum mechanics and, in particular, his intro¬ 
duction of what is often called “matter waves,” 152 we shall ask ourselves 
whether—as it has so often occurred with great innovations in science— 
his ideas had not been anticipated, at least in part, by speculative thinkers 
in the past. Because of his optical-mechanical analogy Hamilton is often 
regarded as a direct precursor of de Broglie’s hypothesis. This, however, is 
only partially true. For Hamilton declared: “Whether we adopt the New¬ 
tonian or the Huyghensian, or any other physical theory, for the explana¬ 
tion of the laws that regulate the lines of luminous or visual communica¬ 
tion, we may regard these laws themselves, and the properties and 
relations of these linear paths of light, as an important separate study, and 
as constituting a separate science, called often mathematical optics.” 153 
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Hamilton’s interest, it is clear, lay wholly in the formal-mathematical 
aspects and not in the problem of the physical nature of light. 

There were in the nineteenth century, however, some unorthodox 
thinkers who maintained, whether on physical or on philosophical 
grounds, the ultimate identity between particles and waves. One of the 
most eloquent proponents of these ideas was Baron Nicolai Bellingshau¬ 
sen. After studying physics at the Universities of Dorpat, Leipzig, and 
Heidelberg, he returned to his Baltic estate and wrote a number of inter¬ 
esting studies on physics while administering his lands and farms. His 
major work, an enlarged version of an earlier essay, 154 was published in 
1872 under the title Grundzuge einer Vibrationstheorie derNatur (Foun¬ 
dations of a Vibration Theory of Nature). 155 Basing his ideas on the carte¬ 
sian conception of extension as the chief characteristic of matter 156 and on 
the kinetic theory of heat which reduced heat to motion, Bellingshausen 
identified atoms with standing waves and interpreted motion of particles 
as a vibrational process. According to his doctrine bodies are merely “ex¬ 
tended centers of vibrational motions” and the “nature of matter is exten¬ 
sion in motion.” 157 A similar theory, though more related to the math¬ 
ematical problem of the nature of space, was proposed at about the same 
time by William Kingdon Clifford, the translator of Riemann’s works into 
English. Seeing in Riemann’s conception of space the possibility for a 
fusion of physics with geometry, Clifford interpreted the motion of matter 
as a manifestation of changes in the curvature of space. 158 

In spite of certain similarities, it would be definitely a mistake to as¬ 
sume that de Broglie had been influenced by such speculations. He did not 
even know of their existence when he advanced his new ideas. Apart from 
those more general factors, which we have mentioned previously, there 
was, however, an additional, more specific, factor which seemed to have 
determined the direction of his thought. It was a series of papers on what 
we would today call a hydrodynamic model of a vibrating atom and was 
written by Marcel Brillouin, whom de Broglie once called “le veritable 
precurseur de la mecanique ondulatoire.” 159 In 1919 Brillouin, a professor 
of physics at the College de France, published a paper 160 in which he 
studied the periodic motion of a vibrating particle in an elastic medium. 
Assuming that the velocity of the orbiting particle exceeds many times 
that of the elastic waves in the medium, he showed that with each given 
position of the particle a finite number n of preceding positions can be 
associated in such a way that the particle at the given position (at given 
time) attains precisely those n elastic waves which originated from the 
particle when it was at the preceding positions. Considering the case of a 
constant n as a stationary state, Brillouin attempted to relate this model 
with Bohr’s atom and to account for its optical properties by an appropri- 
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ate choice of the constants at his disposal. 161 In a second paper 162 he gener¬ 
alized his derivation of discontinuous properties, characterized by inte¬ 
gers, and introduced a variable propagation velocity of the elastic waves. 
He suggested that an appropriate determination of the inverse value of 
this velocity, which value is a function of the distance r from the nucleus, 
and which he significantly denoted by «(r), could make it possible to 
derive, for example, the Balmer terms in Bohr’s theory of the hydrogen 
atom. In a third paper 163 he introduced a particular Lagrangian function 
for the immediate surroundings of the nucleus where the Hertz-Maxwell 
equations, as suggested by Bohr’s theory, cease to be valid. Referring to 
the two main suggestions he had made, he concluded this paper with the 
words: “But perhaps there exists a different, third road on which a young 
audacious investigator may achieve success.” 164 Louis de Broglie un¬ 
doubtedly was well acquainted with these papers. He even seemed to have 
received reprints of them from Brillouin himself. 165 

After these remarks we shall now analyze Louis de Broglie’s contribu¬ 
tions to the conceptual development of quantum mechanics. In his search 
for a “synthetic theory of radiation” 166 which combines the aspects of 
waves and of particles, he naturally turned at first to Einstein’s work on 
light quanta. Following Einstein, he treated black-body radiation as a gas 
of light quanta and showed in the earliest of his publications on this sub¬ 
ject that such a treatment, if subjected to classical statistical mechanics, 
leads to Wien’s distribution law. 167 When he showed this paper to Lange- 
vin, he received the answer: “Your ideas are interesting, but your gas has 
nothing to do with true light.” In a second memoir, 168 in which he tried to 
reconcile Einstein’s hypothesis of light quanta with the phenomena of 
interference and diffraction, he suggested for the first time the idea that it 
would be necessary to associate with those quanta a certain element of 
periodicity. 169 Searching for such a periodicity—one should realize that 
only later, of course, it became clear that if the particle is a photon, the 
associated frequency, which de Broglie was looking for, is precisely the 
radiation frequency r and that the ordinary light wave is precisely the de 
Broglie wave of the photon—he studied the analogies between the formal¬ 
ism of analytical mechanics and that of undulatory theories, analogies 
which had “strikingly impressed” his mind from his “early youth.” 170 
Finally, at the end of the summer of 1923, his conception of phase waves 
began to take shape. The paper 171 in which he proposed these ideas may 
rightfully be regarded as initiating the theory of wave mechanics. 

De Broglie considered the motion of a particle of rest mass m 0 and 
velocity v = fic with regard to a stationary observer and assumed that the 
particle is the seat of a periodic internal phenomenon whose frequency v 0 
is given by m 0 c 2 /h , where h is Planck’s constant and m 0 c 2 the internal 
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energy of the particle. Now, the stationary observer, de Broglie main¬ 
tained, ascribes to the particle the energy m 0 c 2 /( 1 — P 2 ) 112 and hence to 
the periodic phenomenon the frequency r = v 0 /(I — P 2 ) 112 . Looking on 
the internal phenomenon, however, the observer will see its frequency, 
owing to the relativistic time dilation, being lowered to v x = v 0 
( \ —P 2 ) 112 and its vibrations varying as sin 2i tv x t. It was this discrepancy 
between r and v x which so greatly attracted his attention and “determined 
the whole trend” 172 of his research. To resolve this dilemma, de Broglie 
introduced “a fictitious wave, associated with the motion of the mobile” 
(“une onde Active associee au mouvement du mobile”), which spreads 
with velocity c/P and frequency r as defined above. He then could prove 
that if at the beginning the internal phenomenon of the moving body is in 
phase with the wave, “this harmony of phase will always persist.” Since 
the body, which at time t = 0 coincided with the wave at the origin, is at 
time t = t at x = vt, its internal phenomenon is proportional to sin ( 2ttv x 
x/v), whereas the wave at that place is given by 


sin|^27rr^ — 2ttvx^ — — 

But by definition v x = v( 1 — P 2 ), which proves the theorem. This result, 
continued de Broglie, suggests “that any moving body may be accompa¬ 
nied by a wave and that it is impossible to disjoin motion of body and 
propagation of wave.” 173 

De Broglie then applied this result to the motion of an electron on a 
closed trajectory with period T\ assuming that the associated wave de¬ 
scribes the same trajectory with the velocity c/p. If at the time t = 0 the 
electron was at the origin O (of the trajectory), he supposed that the wave, 
starting from 0, reaches the electron at the time rat O ', where OO ' = (c/ 
/?)r.Thus,r = (P /c)[Pc(t + T)]orr= TP 2 /(\ — P 2 ),andhencethe 
internal phase of the electron changed by 



2irv,r = 277-^-- T ^\ = 2ttD 

h (1 — p 2 ) 112 

Claiming that it is almost self-evident that the motion is stable only if “the 
fictitious wave, passing through O ', finds the electron in phase with itself ” 
or that D = n (n is an integer), de Broglie demonstrated that the last 
equation is just the Sommerfeld quantum condition J= SoP dq = nh, 
since in the present case 


J= f - m \ - 7 - \?dt = Wp/? Y T — Dh 

Jo ( 1-/? 2 ) 1/2 (1 — P 2 )' 12 


In a second 174 and a third 175 memoir, published two and four weeks 
later, de Broglie demonstrated that the velocity v = Pc of the particle is 



Formation of Quantum Mechanics 


249 


precisely the group velocity 176 of the phase waves. 177 The statement that 
the particle follows in every point of its trajectory the ray of its phase wave, 
that is, the normal to the equiphase surface at that point, was now for him 
the basic postulate for the dynamics of free particles. 

Generalizing this result for the case of curved paths and variable veloc¬ 
ities in a field of force and retaining this postulate, de Broglie pointed out 
that the path can be computed “as in a medium of variable dispersion” by 
means of Fermat’s principle 



where A is the wavelength of the phase waves, V = c//3 their velocity of 
propagation, and v their frequency. Fermat’s principle can also be ex¬ 
pressed by the equation 

f-g ?pff c ds = s{ pds = 0 

J (1 — 0 2 ) 112 J p 

which is precisely Maupertuis’s variational principle. This conclusion, de 
Broglie emphasized, justifies the assumption of the above-mentioned pos¬ 
tulate. “The fundamental bond which unites the two great principles of 
geometrical optics and of dynamics is thus fully brought to light.” 178 

In general, the trajectory of the particle will thus be determined by the 
Fermat-Maupertuis principle. If, however, the particle has to pass 
through an aperture or slit of dimensions comparable with the wavelength 
of the phase waves, the particle’s path will be curved in accordance with 
the diffraction of the phase waves. Introducing the additional assump¬ 
tion 179 that the probability of the absorption or scattering of a light quan¬ 
tum by an atom is determined by the geometrical resultant of the phase- 
wave vectors crossing upon it—a hypothesis which according to de 
Broglie is “analogous to that which is admitted by electromagnetic theory 
when it links intensity of disclosed light with the intensity of the resultant 
electric vector”—he reconciled interference and diffraction with the hy¬ 
pothesis of light quanta. He explained, for example, Young’s experiment 
as follows: “Some atoms of light pass through the holes and diffract along 
the ray of the neighbouring part of their phase waves. In the space behind 
the wall, their capacity of photoelectric action will vary from point to 
point according to the interference state of the two phase waves which 
have crossed the two holes. We shall then see interference fringes, how¬ 
ever small may be the number of diffracted quanta, however feeble may be 
the incident light intensity. The light quanta do cross all the dark and 
bright fringes; only their ability to act on matter is constantly changing. 
This kind of explanation, which seems to remove at the same time the 
objections against light quanta and against the energy propagation 
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through dark fringes, may be generalized for all interference and diffrac¬ 
tion phenomena.” 180 

These conclusions, de Broglie continued, should apply to electrons as 
well: “a stream of electrons passing through a sufficiently narrow hole 
should also exhibit diffraction phenomena.” 181 “It is in this direction 
where one has probably to look for experimental confirmations of our 
ideas.” 182 

This synthesis of waves and particles (or quanta), de Broglie main¬ 
tained, could be obtained because the phase wave is conceived as “guiding 
the displacements of the energy.” 183 The wave theory, in his view, went 
too far by denying the discontinuous structure of radiative energy, and not 
far enough by renouncing its relevancy to mechanics. “The relation be¬ 
tween the new dynamics of free particles and the old dynamics is exactly 
that between wave optics and geometrical optics.” 184 He concluded with 
this statement: “Many of these ideas may be criticized and perhaps re¬ 
formed, but it seems that now little doubt should remain of the real exis¬ 
tence of light quanta. Moreover, if our opinions are received, as they are 
grounded on the relativity of time, all the enormous experimental evi¬ 
dence of the ‘quantum’ will turn in favour of Einstein’s conceptions.” 185 

On November 29, 1924, de Broglie presented his doctoral thesis “Re- 
cherches sur la Theorie des Quanta” 186 to the Faculty of Science at the 
University of Paris. Essentially a detailed elaboration of his preceding 
papers, it contained seven chapters: (1) the phase wave, (2) the principles 
of Maupertuis and of Fermat, (3) the quantum conditions of stability, (4) 
the quantification of simultaneous motions of two electrical centers, (5) 
the quanta of light, (6) the diffusion of x- and y rays, (7) statistical 
mechanics and the quanta. Besides, it was prefaced by a historical intro¬ 
duction of great interest for the historian of optics. Called by Darwin 
“truly one of the most important documents in the history of scientific 
research,” 187 it concluded with these words: “The definitions of the phase 
wave and of the periodic phenomenon were purposely left somewhat va¬ 
gue...so that the present theory may be considered a formal scheme whose 
physical content is not yet fully determined, rather than a fullfledged 
definite doctrine.” 188 Although the examining committee highly ap¬ 
praised the originality of the work, it did not believe in the physical reality 
of the newly proposed waves. When asked by Perrin whether these waves 
could be experimentally verified, de Broglie replied that this should be 
possible by diffraction experiments of electrons by crystals. In fact, de 
Broglie seems to have suggested just such an experiment to Alexandre 
Dauvillier, 189 one of his brother’s collaborators in the rue Byron, but Dau- 
villier was too busy with experimenting on television. 190 
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Perrin’s question, as we know today, could have been given an affirma¬ 
tive answer on the basis of experimental evidence available at the time, 
nay, even dating back to 1914, as we shall presently see—if not much 
earlier 191 

In the summer of 1924, Einstein received from Satyandra Nath Bose of 
Dacca University in India a letter which began with these words: “Re¬ 
spected Sir, I have ventured to send you the accompanying article for your 
perusal and opinion. I am anxious to know what you think of it. You will 
see that I have tried to deduce the coefficient 87 rrVc 3 in Planck’s law 
independent of classical electrodynamics, only assuming that the ultimate 
elementary region in the phase space has the content h 3 . I do not know 
sufficient German to translate the paper. If you think the paper worth 
publication I shall be grateful if you arrange for its publication in Zeit- 
schrift fur Physik. Though a complete stranger to you, I do not feel any 
hesitation in making such a request. Because we are all your pupils though 
profiting only by your teaching through your writings ...” 192 

The article referred to was, of course, Bose’s well-known derivation of 
Planck’s radiation law on the basis of what is now known as the Bose- 
Einstein statistics. Einstein, in fact, immediately recognized the impor¬ 
tance of the paper, translated it, and sent it to the editor of the Zeitschrift 
fur Physik} 93 

But Einstein did more. Applying Bose’s approach to the monatomic 
gas , 194 he realized that “a far-reaching formal relationship between radi¬ 
ation and gas” (“eine weitgehende formale Verwandschaft zwischen 
Strahlung und Gas”) can be established. For he showed that—if the gas 
particles are subjected to the new statistics—the mean-square energy fluc¬ 
tuation is given by an expression which, like Eq. (1.19) on page 28, is 
composed of two additive terms , 195 one corresponding to the Maxwell- 
Boltzmann statistics of noninteracting molecules and the other to the 
interference fluctuation associated with undulatory phenomena. 

Now, as we know from reliable sources, Langevin, in spite of the lack of 
any as yet recognized empirical verifications, spoke about de Broglie’s 
work at the fourth Solvay Congress 196 in April, 1924. He also informed 
Einstein about it . 197 Einstein, working at that time on the implications of 
Bose’s paper, became naturally very much interested in de Broglie’s thesis 
and asked 198 for a copy, which he received and read in December, 1924. 
As a result, in his second paper 199 on the quantum theory of the ideal gas, 
Einstein referred, in the context of his discussion on the interference term 
in the energy-fluctuation formula, to de Broglie’s work and declared:. “I 
shall discuss this interpretation in greater detail because I believe that it 
involves more than merely an analogy .” 200 
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After Einstein, now in turn, had drawn Bom’s attention 201 to de Brog¬ 
lie’s thesis, Bom discussed it with James Franck, who had headed the 
department of experimental physics at the University of Gottingen since 
1920. Bom’s student Walter Elsasser, who had come to Gottingen in 
1924, also attended the discussion. When Elsasser suggested the perfor¬ 
mance of a diffraction experiment with free electrons as a possible confir¬ 
mation of the existence of de Broglie waves, Franck commented that this 
“would not be necessary since Davisson’s experiments had already estab¬ 
lished the existence of the expected effect.” 202 

To understand Franck’s remarks, the following developments incident 
to Davisson’s work must be borne in mind. Clinton Joseph Davisson, 
interested in the secondary electron emission from electrodes in vacuum 
tubes, had been experimenting since 1919, mostly together with C. H. 
Kunsman, on the scattering of electrons from metals at the Research 
Laboratories of the American Telephone and Telegraph Company and 
Western Electric Company in New York. In their first publication on 
such experiments Davisson and Kunsman declared: “All the main fea¬ 
tures of the distribution curves so far observed for the scattering from 
nickel seem reasonably accounted for on the supposition that a small 
fraction of the bombarding electrons actually do penetrate one or more of 
the shells of electrons which are supposed to constitute the outer structure 
of the nickel atom and, after executing simple orbits in a discontinuous 
field, emerge without appreciable loss of energy.” 203 Interpreting the an¬ 
gular distribution of the scattered electrons as due to the variation of 
charge density in the atomic shells, Davisson proposed in 1923 a formu¬ 
la 204 for the confirmation of which he and Kunsman experimented on the 
scattering of low-speed electrons by platinum and magnesium targets. 205 

Now, when Franck studied the diagrams of the Davisson-Kunsman 
experiments, he realized that the peaks obtained could be interpreted as 
due to diffraction phenomena. In fact, Franck and Elsasser soon recog¬ 
nized that the maxima of the angular distributions of the reflected elec¬ 
trons move with increasing velocity of the incident beam toward the direc¬ 
tion of the primary beam and thus exhibit features similar to those 
connected with the well-known phenomena of x-ray diffraction by crys¬ 
tals. They explained therefore the “scattering” as a diffraction process and 
found that the wavelength involved agreed with the value obtained from 
de Broglie’s formula A = h /mv , where m is the mass and v the velocity of 
the electrons. 

Moreover, Elsasser also realized that Ramsauer’s strange experimental 
results 206 could be explained as an interference effect of the de Broglie 
waves—results which, by the way, were at about the same time indepen¬ 
dently obtained also by Townsend and Bailey 207 as well as by Chaud- 
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huri, 208 namely, that for slow electrons of energy values below 25 eV the 
scattering cross section of rare gases, particularly of argon, decreases as if 
the atom becomes completely transparent to the electron. Strictly speak¬ 
ing, the “Ramsauer effect” or “Ramsauer-Townsend effect,” as it was 
later called, had already been noticed in 1914 by A. Akesson, a former 
student of J. R. Rydberg and M. Siegbahn at the University of Lund. 
When working on his Ph.D. thesis 209 in Lenard’s laboratory in Heidel- 
berg, Akesson found that the effective scattering cross section of certain 
gases decreased for slow electrons in a way which was inconsistent with 
the kinetic theory. A few years later, however, H. F. Mayer claimed in his 
thesis, also at the University of Heidelberg, that he had disproved Akes- 
son’s results. 210 Eventually, it was Ramsauer who provided an incontesta¬ 
ble proof of the existence of the effect by employing what later became 
known as the “Ramsauer circular method” (“Ramsauer Kreismeth- 
ode”). 

Elsasser summarized these conclusions in a paper which he sent to 
Arnold Berliner, the editor of Die Naturwissenschaften. Doubtful whether 
to publish the paper, Berliner consulted von Laue and Pringsheim, with¬ 
out, however, obtaining any commitment on their part, and also Einstein, 
who endorsed its publication. 211 Thus Elsasser’s note, 212 concluding with 
an acknowledgment to Franck, was published in the middle of July, 1925. 

Interestingly, however, Davisson, whose attention was drawn to this 
paper, “did not think much of [it] because he did not believe that El¬ 
sasser’s theory of his (Davisson’s) prior results was valid.” 213 In fact, 
Elsasser’s note seems to have had little direct influence on Davisson’s 
subsequent work which so brilliantly vindicated de Broglie’s hypothesis. 
What started this famous series of experiments was rather a peculiar acci¬ 
dent which occurred in Davisson’s laboratory in April, 1925. In the 
course of an experiment on the angular distribution of electrons scattered 
by a target of ordinary (polycrystalline) nickel “a liquid-air bottle ex¬ 
ploded at a time when the target was at a high temperature; the experi¬ 
mental tube was broken, and the target heavily oxidized by the inrushing 
air. The oxide was eventually reduced and a layer of the target removed by 
vaporization, but only after prolonged heating at various high tempera¬ 
tures in hydrogen and in vacuum.” 214 When Davisson and Germer re¬ 
sumed their experiments with the nickel target, which, because of recrys¬ 
tallization during the heating, had become a single crystal (or rather a set 
of a small number of such crystals), they noticed that the angular distribu¬ 
tion “completely changed.” Thus, when monoenergetic electrons were 
normally directed against the (111) planes of the target, the pattern 
showed for a critical electron energy of 54 eV a distinct lobe at a Bragg 
angle q> = 50° and the lobe disappeared when the electron-accelerating 
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voltage was appreciably changed. Since the grating constant for the nickel 
crystal is a = 2.15 A, Bragg’s diffraction formula yielded a wavelength 
A = 1.65 A, which agreed perfectly with its value obtained from de Brog¬ 
lie’s formula A = h /mvzz(\50/V) l/2 A for the accelerating potential 
V = 54 volts. 

Until these results were obtained, however, it had taken almost another 
two years after the historic explosion which, as Darrow put it, “blew open 
the gate to the discovery of electron-waves.” 215 “In 1926 Davisson had the 
good fortune to visit England and attend the meeting of the British Associ¬ 
ation for the Advancement of Science at Oxford. He took with him some 
curves relating to the single crystal, and they were surprisingly feeble. He 
showed them to Bom, to Hartree and probably to Blackett. Bom called in 
another Continental physicist (possibly Franck) to view them, and there 
was much discussion on them. On the whole of the westward transatlantic 
voyage Davisson spent his time trying to understand Schrodinger’s pa¬ 
pers, as he then had an inkling (probably derived from the Oxford discus¬ 
sions) that the explanation might reside in them. In the autumn of 1926 
Davisson calculated where some of the beams ought to be, looked for 
them and did not find them. He then laid out a program of thorough 
search, and on 6 January 1927, got strong beams due to the line-gratings of 
the surface atoms, as he showed by calculation in the same month.” 216 

Meanwhile E. G. Dymond, at the Palmer Laboratory of Princeton 
University, examined the scattering of electrons by helium at various in¬ 
clinations to the primary beam. Having passed through two slits after the 
scattering, the electrons were deflected by a magnetic field into a collector, 
the field being adjusted so that electrons of only a particular velocity were 
collected. Dymond concluded that “the occurrence of...maxima is strong¬ 
ly suggestive of an interference pattern, as suggested by Elsasser.” 217 

In May, 1927, George Paget Thomson, with the assistance of his re¬ 
search student Andrew Reid, completed in Aberdeen the first series of 
diffraction experiments of electrons by thin films. 218 Thus, while Davis¬ 
son’s experiments were analogous to Laue’s method of x-ray diffraction, 
Thomson’s resembled the Debye-Hull-Scherrer technique. Thomson 
made a beam of monoenergetic cathode rays pass normally through a thin 
film of celluloid. The rays were then received on a photographic plate 
which, after being developed, showed a symmetrical pattern similar to 
those observed with x-rays. Another collaborator of Thomson’s, R. Iron¬ 
side, 219 performed similar experiments with films of copper, silver, and 
tin, metals which crystallize in face-centered cubes, while Seishi Kiku- 
chi 220 at the University of Tokyo used mica for this purpose. Summarizing 
these and his own experiments, Thomson could declare: “Cathode rays 
behave as waves of wavelength h /mv according to de Broglie’s theory of 
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wave mechanics.” 221 Likewise, Davisson, at a meeting of the American 
Optical Society in Washington on November 2, 1928, could announce: 
“During the last few years we have come to recognize that there are cir¬ 
cumstances in which it is convenient, if not indeed necessary, to regard 
electrons as waves rather than as particles, and we are making more and 
more frequent use of such terms as diffraction, reflection, refraction and 
dispersion in describing their behavior.” 222 In fact, since 1928 experimen¬ 
tal evidence for the correctness of de Broglie’s ideas increased enormous¬ 
ly. We mention only Rupp’s important experiments on electron diffrac¬ 
tion by mechanically ruled gratings, 223 the experiments performed by 
Estermann and Stem 224 and by Estermann, Frisch, and Stem 225 on the 
diffraction of molecular beams of hydrogen and helium, and the more 
recent experiments, carried out by Fermi and Marshall 226 and Zinn, 227 
within the framework of the Manhattan Project, on the diffraction of 
neutron beams. 

On April 30,1897, “a date which may fittingly be taken as marking the 
dawn of modem particle physics,” 228 Joseph John Thomson announced 
before the Royal Institution of Great Britain his discovery of the electron, 
for the study of whose corpuscular properties, such as its charge-to-mass 
ratio, he was awarded the Nobel Prize in 1906. Reviewing his work on 
cathode rays, he declared in his Nobel Prize address: "... I wish to give 
an account of some investigations which have led to the conclusion that 
the carriers of negative electricity are bodies, which I have called cor¬ 
puscles....” 229 

Forty years later, in 1937, Davisson and George Paget Thomson 
shared the Nobel Prize “for the experimental discovery of the interference 
phenomena in crystals irradiated by electrons.” If physical reality “is 
what it is because it does what it does,” 230 then one may feel inclined to say 
that Thomson, the father, was awarded the Nobel Prize for having shown 
that the electron is a particle, and Thomson, the son, for having shown 
that the electron is a wave. But, precisely, what is the physical nature of 
the de Broglie waves whose reality has been so irrefutably demonstrated? 

De Broglie’s hypothesis which associated with every particle a wave 
was soon elaborated into a new theory of mechanics. If there are waves, it 
was argued, there must be a wave equation. What was now needed was the 
discovery of this wave equation. It was also assumed 231 that a theory of 
mechanics, based on such an equation, must include ordinary particle 
mechanics as a limiting case, just as relativistic mechanics contained New¬ 
tonian mechanics in the limit of velocities relative to which the velocity of 
light may be regarded as infinitely large, a relation symbolically expressed 
by oo , or just as undulatory optics contained geometrical optics in the 
limit of wavelengths which may be regarded as infinitely small if com- 
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pared with the dimensions of the apparatus, or symbolically for A->0. A 
rigorous proof that in this limit undulatory optics reduces to geometrical 
optics had been given before 1911, probably for the first time, by Debye, 232 
when he showed that the basic equation of undulatory optics, 
At/ + k 2 u = 0, can be approximated for large k by the eikonal equation 
| grad W\ = n, the fundamental equation of geometrical optics. 233 

There can be little doubt that Debye’s calculation must have been as 
follows. In the wave equation A u + n 2 k 0 2 u = 0 he put u = A exp ( ik 0 W) 
and treated A and W as slowly varying functions. Since k 0 is by assump¬ 
tion very large, he neglected in the resulting differential equation for W 
and A all lower powers of k 0 in comparison with the highest power occur¬ 
ring (k 0 2 ) and obtained (VW) 2 = n 2 . 

Now, it will be recalled that the eikonal equation is the optical analogue 
of Hamilton’s differential equation of ordinary mechanics. It should 
therefore be clear that what was needed for the establishment of the new 
mechanics was the analogue for mechanics of Debye’s limit consideration, 
but in the reverse direction, that is, an extension of Hamilton’s equation 
into a differential equation which determines waves of the kind envisaged 
by de Broglie. As we know from reliable sources, 234 Debye was in fact 
working along these lines. Also E. Madelung, 235 who—like so many oth¬ 
ers at that time—had serious doubts about the reality of atomic orbits, was 
trying to devise a “kind of wave theory of the atomic levels.” As is well 
known, however, it was Schrodinger who succeeded in constructing with¬ 
in a few months’ time practically the whole of what is now known as 
nonrelativistic wave mechanics. 

Erwin Schrodinger, like Heisenberg, graduated from a “classical gym¬ 
nasium,” where Latin and Greek were major subjects of instruction. His 
humanistic education greatly influenced his later views on science and life 
and laid the foundations for a profound interest in the problems of classi¬ 
cal and modem philosophy, as his writings eloquently attest. 236 From 
1906 until his Ph.D. in 1910 he studied at the University of Vienna, pri¬ 
marily under the guidance of Fritz Hasenohrl in theoretical physics, 
Franz Exner in experimental physics, and Wilhelm Wirtinger in math¬ 
ematics. In 1911 he became Exner’s assistant at the Institute. “The old 
Vienna Institute,” he said later, 237 “which had just mourned the tragic 
loss of Ludwig Boltzmann, the building where Fritz Hasenohrl and Franz 
Exner carried on their work and where I saw many others of Boltzmann’s 
students coming and going, gave me a direct insight into the ideas which 
had been formulated by that great mind. His line of thought may be called 
my first love in science. No other has ever thus enraptured me or will ever 
do so again.” That Boltzmann had indeed influenced Schrodinger greatly 
can be shown by a detailed analysis of Schrodinger’s early work on the 
kinetic theory of gases, statistical mechanics, and elasticity. 238 It is par- 
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ticularly interesting to note in retrospect how, in his papers on the dynam¬ 
ics of elastically bound mass points, he had struggled with the problem of 
atomicity versus continuity or phenomenology, as the latter approach was 
still called at that time. This problem, it will be recalled, engaged leading 
physicists and philosophers of science, such as Boltzmann, Mach, and 
Volkmann, in heated debates at the turn of the century. Referring to 
Boltzmann’s thought-provoking defense of atomicity, 239 Schrodinger 240 
wrote: “Atomicity has the additional task by whose accomplishment 
alone its superiority over the phenomenological theories can be estab¬ 
lished. It has to discover and predict conditions under which the differen¬ 
tial equations, based on conceptions of continuity, would lead—because 
of the really atomistic structure of matter—to evidently false conclu¬ 
sions.” In the paper to which Schrodinger referred, Boltzmann had asked: 
“Has not atomistics in its current form great advantages over the present¬ 
ly so fashionable phenomenology? Is it likely that in the foreseeable future 
phenomenology will produce a theory which has the same potentialities as 
atomicity?” 241 And Boltzmann continued: “Should it ever happen that a 
theory will be constructed which is as comprehensive as atomicity and 
which is based on foundations as intelligible and unassailable as those of 
Fourier’s theory on heat conduction, this would truly be an ideal.” 242 It 
would be interesting to know whether Boltzmann, were he alive in 1926, 
would have regarded Schrodinger’s wave theory of the atom, based on a 
“diffusion equation” a la Fourier, as such an ideal. 

In his early years Schrodinger was working in the physics of contin¬ 
uous media and acquired the mastery of eigenvalue problems which 
proved so valuable for his future work. He also studied intensively Ray¬ 
leigh’s Theory of Sound and, in particular, learned early what important 
role the notion of group velocity plays in the theory of vibrations, as his 
paper on the acoustics of the atmosphere 243 clearly shows. In addition, 
deeply interested in philosophy, he read the writings of Spinoza, Schopen¬ 
hauer, Mach, Richard Semon, and Richard Avenarius. 244 

In 1920 he left Vienna to assist Max Wien in Jena. Four months later he 
accepted an associate professorship at the Technische Hochschule in 
Stuttgart. Finally, after a short professorship in Breslau, he settled in 
Zurich in 1921 and stayed there until 1927, when he succeeded Planck in 
Berlin. 

About the external circumstances which made Schrodinger start his 
important work Debye made the following statement: “Then de Broglie 
published his paper. At that time Schrodinger was my successor at the 
University in Zurich, and I was at the Technical University, which is a 
Federal Institute, and we had a colloquium together. We were talking 
about de Broglie’s theory and agreed that we did not understand it, and 
that we should really think about his formulations and what they mean. 
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So I called Schrodinger to give us a colloquium, and the preparation of 
that really got him started. There were only a few months between his talk 
and his publications.” 245 

Debye’s choice, it seems, was not accidental. For Schrodinger—and 
this brings us to a second, more profound incitement—was very eager to 
study de Broglie’s ideas and their possible implications. Interested in sta¬ 
tistical mechanics, he had read Einstein’s paper 246 on the quantum theory 
of the ideal gas in which, as we have seen, Einstein expressed his belief that 
de Broglie’s conceptions “involve more than merely an analogy.” In fact, 
it was this remark which induced Schrodinger to study what he referred to 
at that time as the “de Broglie-Einstein undulatory theory, according to 
which a moving corpuscle is nothing but the foam on a wave radiation in 
the basic substratum of the universe.” 247 Schrodinger himself acknowl¬ 
edged frankly his intellectual indebtedness to Einstein. “By the way,” he 
wrote to Einstein on April 23, 1926, “the whole thing would not have 
started at present or at any other time (I mean, as far as I am concerned) 
had not your second paper on the degenerate gas directed my attention to 
the importance of de Broglie’s ideas.” 248 

Schrodinger once confided to Dirac how he began his work. Trying to 
generalize de Broglie’s waves to the case of bound particles, he “finally 
obtained a neat solution of the problem, leading to the appearance of the 
energy-levels as eigenvalues of a certain operator. He immediately applied 
his method to the electron in the hydrogen atom, duly taking into account 
relativistic mechanics for the motion of the electron, as de Broglie had 
done. The results were not in agreement with observation. We know now 
that Schrodinger’s method was quite correct, and the discrepancy was due 
solely to his not having taken the spin of the electron into account. How¬ 
ever, electron spin was unknown at that time, and Schrodinger was very 
disappointed and concluded that his method was not good and abandoned 
it. Only after some months did he return to it, and then noticed that if he 
treated the electron non-relativistically his method gave results in agree¬ 
ment with observation in non-relativistic approximation. He wrote up this 
work and published it in 1926, and in this delayed manner Schrodinger’s 
wave equation was presented to the world.” 249 

The statement “Only after some months did he [Schrodinger] return 
to it [his new method of treating the hydrogen spectrum as an eigenvalue 
problem]” referred undoubtedly to the time interval between his two 
attempts at solving the problem, the first under the impact of Einstein’s 
remarks concerning de Broglie’s hypothesis, and the second following 
Debye’s colloquium. And the effect of both these factors upon Schro¬ 
dinger cannot be doubted. 

There are, however, also other accounts for which the present author 
has been unable to find additional independent evidence. Thus, Victor 
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Henri, a former pupil of the physiologist Albert Dastre of the Sorbonne 
and since 1924 professor of physical chemistry at the University of Zurich, 
recalled—according to Edmond H. Bauer 250 —that, while visiting Paris, 
he received from Langevin a copy of “the very remarkable thesis of de 
Broglie”; back in Zurich and having not very well understood what it was 
all about, he gave it to Schrodinger, who after two weeks returned it to him 
with the words: “That’s rubbish.” When visiting Langevin again, Henri 
reported what Schrodinger had said. Whereupon Langevin replied: “I 
think Schrodinger is wrong; he must look at it again.” Henri, having 
returned to Zurich, told Schrodinger: “You ought to read de Broglie’s 
thesis again; Langevin thinks this is a very good work”; Schrodinger did 
so and “began his work.” 

De Broglie’s work gave Schrodinger an important clue for his whole 
approach, as Schrodinger himself admitted. As we have seen on page 248, 
de Broglie had shown that the Sommerfeld quantum condition can be 
interpreted as a statement concerning the number of wavelengths which 
cover the exact orbit of the electron around the nucleus. For Sommerfeld’s 
§p dq = nh and de Broglie’s p = h/ A implied that 

H dq - n 

To Schrodinger this equation immediately suggested the idea of a wave- 
theoretic eigenvalue problem. 

In the first part of his monumental paper “Quantization as a problem 
of proper values” 251 Schrodinger proposed what was later called “the 
Schrodinger time-independent wave equation.” By replacing in the Ham¬ 
ilton equation 

H (q,^)= E (5.65) 

the function 5, assumed to be separable, by K log xp, Schrodinger obtained 


H 



= E 


(5.66) 


where xp now stands for a product of single-valued functions, each depend¬ 
ing on only one of the coordinates q. Equation (5.66) can be written (at 
least in simple cases) as a quadratic form (in xp and its first derivatives) 
equated to zero. Schrodinger then replaced the quantum conditions by the 
following postulate: xp has to be a real, single-valued, twice continuously 
differentiable function for which the integral of the just-mentioned qua¬ 
dratic form over the whole configuration space (q space) is an extremum. 
The Euler-Lagrange equation, corresponding to this variational integral, 
is the wave equation. 
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For the hydrogen atom with potential energy — e 2 /r, where 
r= (x 2 + y 2 + z 2 ) 1/2 , the quadratic form turned out to be 


F=F(xyz^t^-^\ 

dx’dy’dz) 

= (V^) 2 -^£ + ^y = 0 


so that from the variational problem 

SJ = djjj F dx dydz = 0 

he obtained in the usual way the wave equation for xp, 

■ 2 ' 


w +^{ e + t )* =0 


and the condition 


J 


dfSx = 0 
dn 


(5.67) 


(5.68) 


(5.69) 


(5.70) 


where df is an element of the infinite closed surface over which the inte¬ 
gral is taken. 

To solve Eq. (5.69) Schrodinger introduced spherical polar coordi¬ 
nates, wrote ip = x( r ) u (0y<p)y and found that u is a surface harmonic P™ 
(cosO)cosm<poTP , T(cos0) sin m<p, where0<m</and/ = 0,1,2,.... The 
limitation to integral values followed from the postulated single-valued¬ 
ness of \p. In order to find the eigenfunctions and eigenvalues 252 of the 
radial equation for ^(r), he applied—with the assistance of Hermann 
Weyl—the Laplace transformation and obtained the following result: Eq. 
(5.69) or, as it is still called, the Euler equation of the variational problem, 
has solutions, which are everywhere single-valued, finite, and continuous 
and tend to zero at infinity as 1/r, for every positive value of E; for 
negative values of E , however, such solutions exist only if 
me 2 /[K( — 2mE) in ] is a real integer n. The discrete eigenvalue spec¬ 
trum thus turned out to be E = — me*/2K 2 n 2 > n = 1, 2,..., which for 
K = h /2 tt was precisely the Bohr energy spectrum of the hydrogen atom. 
Schrodinger concluded the first part of his paper with the following re¬ 
marks on the significance of \p. “One may, of course, be tempted to associ¬ 
ate the function \p with a vibrational process in the atom, a process possibly 
more real than electronic orbits whose reality is being very much ques¬ 
tioned nowadays. Originally, I also intended to lay the foundations for the 
new formulation of the quantum conditions in this more intuitive manner. 
But later I preferred to present them in the above neutral mathematical 
form because it exposes more clearly the essential point, which in my 
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opinion, is the fact that the mysterious ‘requirement of integralness’ 
(‘Ganzzahligkeitsforderung’) no longer enters into the quantization rules 
but has been traced, so to speak, a step further back by having been shown 
to result from the finiteness and single-valuedness of a certain space func¬ 
tion.” 253 Considering the atom as a system of vibrations, Schrodinger 
thought in a similar vein to have traced back Bohr’s frequency condition 
to the appearance of beats so that Bohr’s formula for the frequency of 
emission, v= (E x /h) — ( E 2 /h), expresses merely the fact, well-known 
from acoustics or from the electromagnetic wave theory (heterodyne fre¬ 
quency), that the beat frequency is equal to the difference of two simulta¬ 
neous characteristic frequencies of the emitter. “It is hardly necessary to 
point out,” wrote Schrodinger, “how much more gratifying it would be to 
conceive a quantum transition as an energy change from one vibrational 
mode to another than to regard it as a jumping of electrons. The variation 
of vibrational modes may be treated as a process continuous in space and 
time and enduring as long as the emission process persists.” 254 

The second part of the paper (second communication 255 ), which could 
have preceded the first part from the logical point of view, explained the 
general ideas which led its author to the formulation of what is here still 
called “undulatory mechanics” (“undulatorische Mechanik” 256 ), the 
term “wave mechanics” (“Wellenmechanik”) being used by Schrodinger 
for the first time only in the beginning of the third part. 257 Elaborating on 
Hamilton’s optical-mechanical analogy, as outlined above, Schrodinger 
considered the Hamilton equation 


dW^f dW\ TT , x A /c . lx 

dt V’ dq ) + 9 ~ 

where W = f (T — U) dt is the action function or Hamilton’s principal 
function, that is, the time integral of the Lagrangian L = T — U, taken 
along a path of the system and regarded as a function of the end points and 
the time. Taking, as usual, W= — Et + S(q), where S is Hamilton’s 
characteristic function, he obtained the equations 

\VW\ 2 = 2m(E-U) (5.72) 

and 


dW 

dt 


E 


(5.73) 


which show 258 that the wave fronts of constant action W travel through 
space with the phase velocity 



E 

[: 2m{E- U)] U2 


(5.74) 



262 


Conceptual Development of Quantum Mechanics 


The particle velocity v , however, is clearly [2m(E — U) ] 1/2 . This dif¬ 
ference between V and v, Schrodinger maintained, was the reason for the 
failure of any attempt at establishing, prior to de Broglie’s discovery, an 
undulatory mechanics as an extension of ordinary mechanics in analogy 
to the generalization of geometrical optics to undulatory optics. “Impor¬ 
tant conceptions in wave theory, such as amplitude, wavelength, and fre¬ 
quency, or—more generally—the wave form , do not at all appear in the 
analogy, as there exists no mechanical parallel; nor is any reference made 
to a wave function. Even the definition of W as signifying the phase is not 
sufficiently clear in the absence of any definition of the wave form. If we 
regard the whole analogy merely as a convenient means of picturization,” 
Schrodinger continued, 259 “then the defect is not disturbing, and we 
would consider any attempt at improving upon it as an idle trifling, as¬ 
suming that the analogy holds precisely with geometrical optics, or, if we 
wish, with a very primitive form of wave optics, and not with the full- 
fledged undulatory optics. That geometrical optics is only a rough ap¬ 
proximation for light does not change the situation. To preserve the analo¬ 
gy for the further development of the optics of q space in accordance with 
a wave theory, we have to be careful not to depart markedly from the 
limiting case of geometrical optics; that is, we must choose the wavelength 
sufficiently small, namely, small compared with all the path dimensions. 
But then the additions do not teach anything new. The picture is only 
draped with unnecessary ornaments. 

“So we might reason to begin with. But even the first attempt to con¬ 
struct an analogy to the wave theory leads to such striking results, that a 
quite different suspicion arises: for today we do know that our classical 
mechanics breaks down for very small dimensions and very great curva¬ 
tures of path. Perhaps this failure corresponds exactly to the failure of 
geometrical optics, that is, ‘the optics of infinitely small wavelengths,’ if 
the obstacles or apertures are no longer great compared with the real, 
finite wavelength. Our classical mechanics is perhaps the complete analo¬ 
gy of geometrical optics and as such is wrong and not in agreement with 
reality; it fails whenever the radii of curvature and the dimensions of the 
path are no longer great compared with a certain wavelength, which has a 
real meaning in q space. Then an undulatory mechanics has to be estab¬ 
lished, and the most obvious approach to it is the elaboration of the Ham¬ 
iltonian analogy into a wave theory.” 

Assuming that for the waves of such an undulatory mechanics the 
phase velocity is given by Eq. (5.74) and the time dependence by a factor 
exp (2m(E /h)t)> Schrodinger obtained 

A y h 

v [2 m(E-U)]' n 


(5.75) 
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for the wavelength in accordance with de Broglie’s result. From the as¬ 
sumption 'P = ip(q,t) =$(q) exp (2m(E/h)t), where i(/(q)=i(/ is a 
space function and E /h the frequency v, and from the general differential 
equation for wave phenomena 

A = 0 (5.76) 

he deduced the fundamental equation 

o_2 m 

A ^ + - 2 T? L ( £ - u )'I' = 0 (5.77) 

h 

in agreement with his previous result (5.69). 

Schrodinger attached great importance to the fact that the particle 
velocity v is equal to the group velocity g, as the application of the formula 
g = dv/d(v/V) immediately shows. For, as he declared, “this fact can be 
used to establish a much more intimate connection between wave propa¬ 
gation and the motion of the representative point than was possible before. 
We can attempt to construct a wave group of relatively small dimensions 
in every direction. Such a wave group will presumably obey the same laws 
of motion as the image point of the mechanical system. It will then give, so 
to speak, a substitute (‘Ersatz’) of the image point as far as its dimensions 
can be ignored, that is, as far as we can neglect any spreading out in 
comparison with the dimensions of the path of the system.” 260 The notion 
of wave groups or wave packets, as they were later called, had been used in 
optics as early as 1909 by Debye, 261 and a few years later by von Laue, 262 
for an exact analytical representation of cones of rays. It now enabled 
Schrodinger to link his undulatory mechanics with ordinary particle me¬ 
chanics. “The true mechanical process,” he explained, “will be realized 
and represented appropriately by the wave processes in q space, and not by 
the motion of image points in this space. The study of the motion of image 
points, which is the object of classical mechanics, is merely an approxima¬ 
tion and has, as such, as much justification as geometrical optics or ‘ray- 
optics’ has compared with the true optical process. A macroscopic me- 

m 

chanical process will have to be represented as a wave signal of the kind 
described above, which can approximately enough be regarded as point¬ 
like compared with the geometrical structure of the path. We have seen 
that for such a signal or wave group precisely the same laws of motion are 
valid as those advanced by classical mechanics for the motion of the image 
point. This procedure, however, becomes meaningless when the structure 
of the path is no longer very large compared with the wavelength or indeed 
is comparable with it. Then we must proceed strictly according to the 
wave theory, that is, we must proceed from the wave equation , and not 
from the fundamental equation of mechanics, in order to include all possi- 
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ble processes. These latter equations are just as useless for the explanation 
of the microstructure of mechanical processes as geometrical optics is for 
the explanation of diffraction .” 263 

In conclusion, Schrodinger applied his theory to the linear harmonic 
oscillator, to the rigid rotator with a fixed axis, and to that with a free axis, 
as well as to the vibrational rotator (diatomic molecule). For Planck’s 
oscillator he obtained as eigenfunctions the Hermitian orthogonal func¬ 
tions exp ( — x 2 /2)H n (jc), where H n (jc) is the nth Hermitian polyno¬ 
mial, and as its eigenvalues E n = (n + \)hv , n = 0,1,2,..., in full agree¬ 
ment with Heisenberg’s matrix-mechanical result. 264 

For the sake of continuity we shall now turn briefly to the two remain¬ 
ing parts of the paper (the third and fourth communications) and post¬ 
pone the discussion of another of Schrodinger’s papers which chronologi¬ 
cally preceded these. 

Schrodinger had already indicated at the end of the second part that the 
applicability of the new theory extended considerably beyond the “direct¬ 
ly soluble” problems and that approximate solutions could be obtained for 
conditions which are sufficiently near to those of soluble problems. The 
third communication, now, contains a detailed exposition of what became 
known as the Schrodinger theory of time-independent perturbations. 
Schrodinger regarded his approach as an extension of a method which 
Rayleigh introduced in his study of acoustical vibrations. In fact, Ray¬ 
leigh prefaced his treatment of the vibrations of a stretched string with 
variable linear density with these words: “The rigorous determination of 
the periods and types of vibration of a given system is usually a matter of 
great difficulty, arising from the fact that the functions necessary to ex¬ 
press the modes of vibration of most continuous bodies are not as yet 
recognized in analysis. It is therefore often necessary to fall back on meth¬ 
ods of approximation, referring the proposed system to some other of a 
character more amenable to analysis, and calculating corrections depend¬ 
ing on the supposition that the difference between the two systems is 
small. The problem of approximately simple systems is thus one of great 
importance, more especially as it is impossible in practice actually to rea¬ 
lize the simple forms about which we can most easily reason.” 265 This 
statement, with the appropriate change of interpretation, was fully en¬ 
dorsed by Schrodinger, who generalized 266 Rayleigh’s method in two re¬ 
spects: the coefficients in the differential equations were no longer, as in 
Rayleigh’s treatment, necessarily constant and the procedure was ex¬ 
tended to apply also to degenerate cases. 

Schrodinger assumed that the Sturm-Liouville eigenvalue problem 


L\f) + Eip — 0 


(5.78) 
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where L is a self-adjoint linear differential operator of the second order, 267 
has for the k th stationary state the normalized eigenfunction solution ip k . 
Taking as the equation for the perturbed case 

Ltp + E\jf = Xnp (5.79) 

where X is the perturbation parameter and r is a function of the coordi¬ 
nates, characteristic for the perturbation, Schrodinger put 

E = E k +Ae k $=il> k +X<p k (5.80) 

where E k and ip k are the eigenvalue and eigenfunction of the unperturbed 
problem and e k and <p k their “corrections,” respectively, for the perturbed 
case. By substitution, neglecting terms in A 2 , he obtained 

L<p k +E<p k = (r-e k )ip k (5.81) 

Since the nonhomogeneous equation (5.81) has a continuous solution 
only if its right-hand side is orthogonal to the solution xfj k of the homoge¬ 
neous equation (5.78), he concluded that 


Ck 



(5.82) 


and thus obtained the wave-mechanical analogue of the classical theorem 
that the first-order energy correction is equal to the mean value of the 
perturbation function averaged over the unperturbed solution. 268 To find 
<p k Schrodinger expanded cp k and the right-hand side of Eq. (5.81), after 
having substituted in it the value of e k , in terms of the unperturbed eigen¬ 
functions i/r i9 which he assumed to form a complete orthonormal system. 
A straightforward calculation yielded 


<Pk = 'Z 

i 

i*k 


'P i Srj> k i/> i dx 
E k - E, 


(5.83) 


After pointing out that the present procedure could easily be extended 
to higher orders of approximation, Schrodinger generalized his method 
for degenerate problems and obtained, in a way well known to every stu¬ 
dent of quantum mechanics, a secular equation whose roots provide the 
required energy corrections. 

In the remainder of the paper Schrodinger applied his perturbation 
theory to the Stark effect of the hydrogen atom in two different ways. 
First, following Schwarzschild and Epstein, he separated the wave equa¬ 
tion in parabolic coordinates and then applied his approximation method 
which led him precisely to the Schwarzschild-Epstein formula, as given 
on page 109; in the second method, applying his perturbation theory liter¬ 
ally as proposed, he expanded the perturbation term, containing the rela- 
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tively small external field F, in terms of the eigenfunctions of the Kepler 
problem. He also calculated the intensities of the components of the Stark 
effect, using the eigenfunctions of the zeroth approximation and the wave¬ 
lengths of the first-order approximation, and obtained results in fair 
agreement with observation. 269 

In the fourth communication, the last part of the paper, Schrodinger 
developed his theory of time-dependent perturbations. Realizing that in 
Eq. (5.77) the eigenvalue E varies in passing from one stationary state to 
another and consequently must not occur in the “real wave equation” 
(“eigentliche Wellengleichung”) which is to determine the spatial-tem¬ 
poral behavior of the wave function 'P = i(/(x,y,z,t ), Schrodinger eliminat¬ 
ed is by means of the equation 'P = xp(q) exp [2m(E /h)t] and obtained 


—V- + IN/ = 

877 2 m 


h dV 
2m dt 


(5.84) 


Schrodinger postulated the validity of Eq. (5.84), later called “the Schro¬ 
dinger time-dependent wave equation,” also for time-dependent potential 
functions U. Recognizing that Eq. (5.84) has the structure of a diffusion 
equation with an imaginary diffusion coefficient, Schrodinger relaxed his 
original requirement concerning the reality of xj; and admitted complex¬ 
valued functions for what he called the “mechanical field scalar t/r” As an 
application of the time-dependent wave equation he studied the phenome¬ 
non of dispersion, which he treated as a problem involving a periodically 
varying perturbing potential-energy function U. His well-known first-or¬ 
der approximation result showed that the secondary radiation contains 
the combination frequencies calculated by Kramers and Heisenberg 270 
and previously predicted by Smekal. 271 Schrodinger concluded his paper 
with a discussion on the physical significance of xft. He interpreted xfjxfj* as a 
weight function in configuration space which accounts for the electrodyn¬ 
amical fluctuations of the electric space density of the charges. He de¬ 
clared: “The xp function is to do no more and no less than to offer us a 
survey and mastery over these fluctuations by a single differential equa¬ 
tion. It has repeatedly been pointed out that the xp function itself cannot 
and may not in general be interpreted directly in terms of three-dimen¬ 
sional space—however much the one-electron problem seems to suggest 
such an interpretation—because it is in general a function in configuration 
space and not in real space.” 272 

Schrodinger’s brilliant paper was undoubtedly one of the most influen¬ 
tial contributions ever made in the history of science. It deepened our 
understanding of atomic phenomena, served as a convenient foundation 
for the mathematical solution of problems in atomic physics, solid state 
physics, and, to some extent, also in nuclear physics, and finally opened 
new avenues of thought. In fact, the subsequent development of nonrela- 
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tivistic quantum theory was to no small extent merely an elaboration and 
application of Schrodinger’s work. Interested as we are in the foundations 
of quantum mechanics, we shall confine our discussion to only those 
elaborations which had a bearing on the conceptual development of the 
theory. 

One of such basic problems, raised, as we have seen, by Schrodinger at 
the very beginning of his paper, was the question of exactly what require¬ 
ments an admissible xp function must satisfy. The reality requirement, as 
mentioned, was soon abandoned. In 1930 G. Jaffe, 273 insisting that only 
physical considerations can serve as a criterion on this issue, contended 
that xp must be single-valued, finite, and continuously differentiable 
throughout configuration space and must possess a second derivative ev¬ 
erywhere except at singular points where the potential is discontinuous. 
That these requirements were too stringent was shown in 1931 by R. M. 
Langer and N. Rosen. 274 Arguing that the ultimate criterion for the ad¬ 
missibility of xp is the finiteness of the variational integral J in Eq. (5.68), 
they declared: “As a rough working rule we may demand of the function 
that it be integrable in the square and that it be finite and continuous 
wherever the potential energy is finite. If we introduce singularities (phy¬ 
sically non-existent) in the potential energy we must be prepared to put up 
with singularities in the wave function at those points. When, in the fu¬ 
ture, we get a deep enough insight into the nature of the physical problems 
so that we can replace the singular values of the potential energy by their 
true finite values, the singularities of the wave functions will also be re¬ 
moved. But until then we must be content with results which differ from 
the facts to the same degree as do the assumptions from which we start.” A 
few weeks later, E. H. Kennard 275 claimed, on the basis of Bom’s probabi¬ 
listic interpretation of xp, which we shall discuss in due turn, that since xpxp* 
must always be integrable in order that the total probability may be unity, 
the xp functions themselves must be quadratically integrable (or, in the 
continuous-spectrum case, Weyl-normalizable). “The customary re¬ 
quirements of continuity and single-valuedness of xp or its derivatives are 
unnecessary as an addition to the fundamental requirement that xp shall 
satisfy a certain differential equation. The condition that xp shall be finite 
everywhere, which serves so well in atomic theory, is in almost all cases 
equivalent to the requirement that xpxp* shall be integrable.” The basic 
requirement for xp , according to Kennard, is that “it shall constitute one of 
such an orthogonal family in terms of which we can expand any xp that can 
occur in Nature.” 276 

This brings us to the problem of the completeness of the eigenfunctions, 
a problem whose basic importance was fully recognized by Schrodinger— 
especially in connection with the perturbation theory, where it was sup¬ 
posed that the eigenfunctions of the unperturbed case form a complete set. 
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Schrodinger’s use of the Laplace transformation for the solution of the 
radial equation in the Kepler problem had the disadvantage, as he ad¬ 
mitted himself, 277 of not showing that the set of the obtained eigenfunc¬ 
tions is complete. In 1927 Eddington 278 showed that the radial equation 
can be solved in a more elementary way and that its solution is, in fact, 
implicitly contained in many standard treatises such as that by Whittaker 
and Watson. 279 In 1928 another short and elementary process for the 
solution of the radial equation was presented to the American Mathemat¬ 
ical Society by T. H. Gronwall. 280 Gronwall also proved the completeness 
of the eigenfunctions of the hydrogen atom, utilizing the closure property 
of the spherical harmonics. 281 

Also Schrodinger’s postulate concerning the single-valuedness of the 
wave functions soon became a matter of dispute. Schrodinger had ar¬ 
gued 282 that /, the wave-mechanical analogue of the azimuthal quantum 
number in the Bohr-Sommerfeld theory, and m, the analogue of the mag¬ 
netic quantum number, are necessarily integral since otherwise no 
uniquely determined dependence of the wave function on the spatial co¬ 
ordinates is assured. The question, expressed in a more modem formula¬ 
tion, was whether the eigenvector representative of the operator L zi the z 
component of the orbital angular momentum, is necessarily single-valued. 
The usual argument is based on Schrodinger’s postulate, for it reasons as 
follows. Since the eigenfunction, say, exp ( imq>) 9 of the operator L z 
= (h /2iri) (d/d<p) has to be single-valued, or exp [im(<p + 2ir)] = 
exp( imq ?), m must be integral. However, as soon as it was recognized that 
in the theory of particles with intrinsic spin double-valued functions have 
to be admitted, Schrodinger’s argument seemed specious. To be sure, the 
requirement of single-valuedness remained valid for observable quantities 
such as the probability density, but not for probability amplitudes such as 
the Schrodinger functions. 

The first to examine this question more closely was Pauli. 283 As a crite¬ 
rion he proposed to admit only those eigenfunctions which, if subjected to 
the angular-momentum operator, yield new functions belonging to the 
same total angular momentum j, i.e., the new functions so obtained should 
be expressible as linear combinations of the original solutions. This crite¬ 
rion, as Pauli could easily show, excluded the possibility of double-valued 
orbital angular-momentum eigenfunctions. An appeal to experiment for a 
decision on this issue was made by David Bohm, 284 who confined the 
theory to the weaker result, according to which orbital angular eigenval¬ 
ues are either all integral or all half odd-integral, but added: “Experiment 
shows that only integral orbital momenta are actually present.” Two fam¬ 
ous footnotes in Blatt and Weisskopf’s treatise on theoretical nuclear 
physics 285 revived the interest in this problem. Calling Schrodinger’s rea- 
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soning “fallacious,” these authors excluded half odd-integral values in 
view of certain unpublished results obtained by A. M. Nordsieck. For 
Nordsieck had shown that the admission of half odd-integral values 
would lead to the physically unacceptable existence of a probability cur¬ 
rent density from one pole of a sphere, centered at the origin of the coordi¬ 
nate system, to the other, the two poles acting as source and sink, respec¬ 
tively. Another way to exclude these half odd integers, without referring 
to the Schrodinger representation and his argumentation, was proposed 
by H. E. Rorschach at the 1962 Southwestern Meeting of the American 
Physical Society. 286 With the help of dynamical variables which have the 
character of true vectors (such as r and p) and which anticommute with 
the parity operator, he constructed “stepping” operators which lower or 
raise the / value of state functions and which, by iteration, lead to algebraic 
equations yielding for the minimum value of / the number zero. In the 
same year Merzbacher, 287 who regarded the single-valuedness under dis¬ 
cussion as resulting from the imposition of boundary conditions, attempt¬ 
ed to connect the problem with the Aharonov-Bohm quantum effect 288 in 
order to enforce a criterion. Buchdahl, 289 on the other hand, resorted to 
the harmonic-oscillator representation in order to show, “by a straightfor¬ 
ward and elementary argument” and without drawing explicitly upon 
conditions of single-valuedness, that the eigenvalues of the components of 
the orbital angular momentum are integral. We have purposely gone into 
these details—for their full comprehension the reader is referred to the 
original literature—to accentuate the interesting fact that Schrodinger’s 
basic assumptions concerning the mathematical nature of iff, quite apart 
from the problem of its physical interpretation, have engaged theoretical 
research ever since they were published. The recent renewed interest in 
the problem of the admissibility conditions for wave functions is, of 
course, also due to the fact that the behavior of field operators in quantum 
field theory depends on the behavior of the wave functions from which 
they are formed. 

After this digression let us return to 1926, when Schrodinger’s deriva¬ 
tion of the “integralness” and discreteness of the energy spectrum from 
the postulated single-valuedness and finiteness of certain space functions 
surprised the world of physics. To be sure, situations of this kind were not 
completely unknown at that time, for boundary-value problems, such as 
that of the stretched string, the tensed membrane or the ball of fluid con¬ 
fined to a spherical shell, had long been solved in mathematical physics. 
Still, it was striking to see how the enforcement of what seemed complete¬ 
ly nonarbitrary conditions led to the same results as the mathematically so 
unfamiliar and conceptually so sophisticated formalism of matrix me¬ 
chanics and its variations. After the conceptual cataclysm evoked by the 
latter it seemed as if Schrodinger’s return to quasi-classical conceptions 
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reinstated continuity. Those who in their yearning for continuity hated to 
renounce the classical maxim natura non facit saltus acclaimed Schro- 
dinger as the herald of a new dawn. In fact, within a few brief months 
Schrodinger’s theory “captivated the world of physics” because it seemed 
to promise “a fulfillment of that long-baffled and insuppressible de¬ 
sire.” 290 Einstein was “enthusiastic” 291 about it, Planck reportedly de¬ 
clared, “I am reading it as a child reads a puzzle,” 291 and Sommerfeld was 
exultant. 291 Gottingen, however, at that time, was rather reserved. 292 Ein¬ 
stein’s enthusiasm is not surprising, for he thought he found in Schro- 
dinger’s theory the answer to a question which for quite some time had 
occupied his mind. As early as 1920 he had written to Bom: “That one has 
to solve the quanta by giving up the continuum, I do not believe.... I am 
rather convinced now as before that one has to search instead for an 
overdetermination by differential equations so that their solutions are no 
longer of the continuous type. But how?” 293 

It is instructive to compare Schrodinger’s wave mechanics with Hei¬ 
senberg’s matrix mechanics. It is hard to find in the history of physics two 
theories designed to cover the same range of experience, which differ more 
radically than these two. Heisenberg’s was a mathematical calculus, in¬ 
volving noncommutative quantities and computation rules, rarely en¬ 
countered before, which defied any pictorial interpretation; it was an alge¬ 
braic approach which, proceeding from the observed discreteness of 
spectral lines, emphasized the element of discontinuity ; 294 in spite of its 
renunciation of classical description in space and time it was ultimately a 
theory whose basic conception was the corpuscle . Schrodinger’s, in con¬ 
trast, was based on the familiar apparatus of differential equations, akin to 
the classical mechanics of fluids and suggestive of an easily visualizable 
representation; it was an analytical approach which, proceeding from a 
generalization of the classical laws of motion, stressed the element of con¬ 
tinuity; and, as its name indicates, it was a theory whose basic conception 
was the wave. Arguing that the use of multidimensional ( > 3) configura¬ 
tion spaces and the computation of the wave velocity from the mutual 
potential energy of particles is “a loan from the conceptions of the corpus¬ 
cular theory,” Heisenberg 295 criticized Schrodinger’s approach as “not 
leading to a consistent wave theory in de Broglie’s sense.” In a letter to 
Pauli he even wrote: “The more I ponder about the physical part of Schro¬ 
dinger’s theory, the more disgusting (‘desto abscheulicher’) it appears to 
me.” Schrodinger was not less outspoken about Heisenberg’s theory when 
he said: “...I was discouraged (‘abgeschreckt’), if not repelled (‘abgestos- 
sen’), by what appeared to me a rather difficult method of transcendental 
algebra, defying any visualization (‘Anschaulichkeit’)” 296 

In spite of these fundamental disparities Schrodinger was always con¬ 
vinced that these two approaches do not clash but rather complement 
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each other. 297 In fact, in the early spring of 1926—prior to the publication 
of the third communication of his magnum opus—Schrodinger discov¬ 
ered what he called “a formal, mathematical identity” of wave mechanics 
and matrix mechanics. Schrodinger’s proof 298 of the intertranslatability 
of either formalism into the other proceeded as follows. Recognizing that 
the Bom-Heisenberg matrix relation (5.26) 


G 


>- 


T $ 


pq - qp = — 1 

2m 

corresponds to the wave-mechanical relation 

h_J \ (J}_JL 

2m dq / \2m dq J 2m 

Schrodinger associated with every physical function F=F(p,q) of the 
variables p and q the differential operator 

f(JlA 

\2m dq 

which he denoted by [F, •]. If the functions u k = u k (q) form in the 
configuration space of the q a complete orthonormal set, for any wave 
function xf; = tf/(q) and tf;' = [F, ifr] 


*) 


ip=Ya k u k (5.85) 

and 

= ^a'jUj (5.86) 

j 

where 

a'j = f uf[F,xjj]dq 

Hence, 

a} = ^ F jk o k (5.87) 

k 

where the elements of the matrix (F jk ) are given by the equation 

F Jk =juf[Fu k ]dq (5.88) 

Equation (5.88) associates—relative to a given basis u k —with every 
function F(p f q) a matrix (F jk ) and this association, as Schrodinger 
showed, satisfies the condition that the matrix associated with the sum 
(product) of two functions is the sum (product) of the matrices associat- 



272 


Conceptual Development of Quantum Mechanics 


ed with the individual terms (factors). Any wave-mechanical equation 
could therefore consistently be translated into a matrix equation, the oper¬ 
ation of F on a wave function xf; corresponding to the application of the 
matrix ( F jk ) on the column vector ( a k ) whose components are the Four¬ 
ier coefficients of i/;. In the second part of the paper Schrodinger specified 
the basic w, by taking in their place the eigenfunctions $ of the wave 
equation. The latter could be written as an operator equation 

[H,t/>]=Eif> (5.89) 

where [H, • ] is the Hamiltonian operator, obtained from the classical 
prototype H(p,q) just as [F t - ] from F(p t q). Its eigenvalues are E k , satis¬ 
fying 

[H,t(, k ]=E k t(, k (5.90) 

Referring now to the matrix-mechanical laws of motion (5.28), Schro¬ 
dinger demonstrated that with respect to the specified basis the matrix H 
is diagonal, since H jk = ] dq = E k 8 jk , or, in other words, solv¬ 

ing (5.89) is equivalent to diagonalizing the matrix H. With the help of 
the auxiliary theorems 

dF 2 tti 

-fv =^[pF-Fp,-] (5.91) 

Voq \ h 

dF 2m 

^ =£ZL[Fq-qF 9 -] (5.92) 

op h 

he showed that the matrices constructed in accordance with Eq. (5.88) do 
in fact satisfy the laws of motion (5.28). 

To complete the proof of formal equivalence, Schrodinger had to show 
not only that the matrices can be constructed from the eigenfunctions but 
also, conversely, that the functions can be obtained from the numerically 
given matrices. “Thus the functions do not form, as it were, an arbitrary 
and particular ‘fleshly clothing’ (‘fleischliche Umkleidung’) of a bare 
matrix skeleton, provided to pander to the need of visualization.” 299 To 
show that such an epistemological superiority of matrix mechanics has no 
justification, Schrodinger supposed that in the equations 

9jk=\UjU k dq (5.93) 

the left-hand sides are numerically given and the functions Uj are to be 
found. For this purpose he pointed out that the integrals 

J P(q)Uj(q)u k (q) dq 

where P(q) signifies any power product of the #’s can be computed by 
matrix multiplication. But the totality of these integrals, for fixed j and k, 
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forms what is known as the “moments’’ of the functions u } u k . Since it is 
well known that under very general assumptions the totality of such mo¬ 
ments determines uniquely the functions themselves, all the products 
UjU kt and hence also all the squares u 2 , and finally the Uj themselves are 
ascertainable. 

“Today,” declared Schrodinger, “there are many physicists who, like 
Kirchhoff and Mach, regard the task of physical theory as being merely a 
mathematical description, as economical as possible, of the empirical rela¬ 
tions between observable quantities, that is, as a description which repre¬ 
sents the relations as far as possible without the intervention of unobserva¬ 
ble elements. On this view, mathematical equivalence and physical 
equivalence have almost the same meaning.” 300 In other words, if a phys¬ 
ical theory were identical with the set of its mathematical relations, just as 
Hertz once identified Maxwell’s theory with the system of Maxwell’s 
equations, wave mechanics and matrix mechanics, according to Schro¬ 
dinger, would be just one theory. 301 

Judged, however, from a strictly formal point of view, Schrodinger 
proved rigorously only that his wave mechanics implies most basic as¬ 
sumptions of Heisenberg’s matrix mechanics, but not that, conversely, 
matrix mechanics implies his wave mechanics. 3013 The credit of having 
been the first to suggest a mathematical approach to prove the full equiv¬ 
alence between these two theories—i.e. that either of them logically im¬ 
plies the other—goes probably to Wolfgang Pauli; for in a letter, written 
on 12 April 1926 to Pascual Jordan, he gave an outline of such a proof. 301b 
As will be shown below, 301c full clarification on this matter has been 
reached only by John von Neumann when he showed in 1929 that, ulti¬ 
mately due to the famous Riesz-Fischer theorem in functional analysis, 
the Heisenberg and Schrodinger formalisms are operator calculi on iso¬ 
morphic (isometric) realizations of the same Hilbert space and hence 
equivalent formulations of one and the same conceptual substratum. 

Schrodinger’s article of the formal equivalence of the two theories was 
received by the editor of the Annalen on March 18, 1926. On March 31 
Carl Eckart of the California Institute of Technology, whose interest 302 in 
quantum mechanics was aroused by Bom’s lecture in Pasadena in the 
winter of 1925, submitted to the National Academy of Sciences a paper 303 
whose declared purpose was “to show, in a purely formal way, that the 
Schrodinger equation must be the basis for the Bom, Jordan and Heisen¬ 
berg matrix calculus.... Lack of space prevents any general discussions,” 
wrote Eckart, “hence, only a very formal solution of the special problem 
of the single linear oscillator is attempted. All results are capable of imme¬ 
diate generalization, however. The general theory, and an attempt to in¬ 
terpret some of the postulates, will be published in another place.” 
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In fact, on June 7, 1926, Eckart completed his paper “Operator calcu¬ 
lus and the solutions of the equations of quantum dynamics,” 304 in which 
he generalized his former result by constructing an operator calculus for 
classical dynamics, extending it to quantum dynamics and showing that 
“the remarkable results of Schrodinger have been included in the same 
calculus as those of Bom and Jordan.” 305 This conclusion, Eckart de¬ 
clared, “would seem to be the strongest support which either of the two 
widely dissimilar theories have thus far received.” 306 Eckart’s approach, 
basically a special case of Schrodinger’s method, consisted in using the 
normalized eigenfunctions of the linear harmonic oscillator 307 for the con¬ 
struction of matrices. His calculation yielded 
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n + 1 ,n 


(n + 1)/* 
4irma) 




nh 

\irmcd 


(5.94) 


or, as Eckart said, “the matrices of the Bom-Jordan theory are really 
obtained.” 

As explicitly acknowledged, Eckart was led to the idea of constructing 
the matrices by integrations of orthonormal eigenfunctions, subjected to 
differential operators, by an interesting paper by Lanczos. 308 For in De¬ 
cember, 1925, Komel Lanczos had shown that the Heisenberg-Born-Jor¬ 
dan theory can be formulated not only in the language of matrices but 
also, equivalently, in that of integral equations. In fact, Lanczos’ reformu¬ 
lation of Heisenberg’s mechanics in terms of integral equations preceded 
Schrodinger’s first communication by about four weeks and was there¬ 
fore, strictly speaking, the earliest continuum-theoretic formalism of 
quantum mechanics. Lanczos’ insistence on the conceptual advantages of 
such a continuous integral representation, as being more closely related to 
field conceptions, was little heeded at the time; the use of integral equa¬ 
tions with which physicists were—and still are—much less familiar than 
with differential equations, the absence of any specific example or new 
result, and certainly also the fact that the publication almost coincided 
with that of Schrodinger’s first communication, explain the relatively cool 
reception Lanczos’ paper was given. As a footnote in Schrodinger’s equiv¬ 
alence article indicates, 309 Schrodinger had carefully studied the Lanczos 
paper and, most probably, he was influenced by it as Eckart was. Schro¬ 
dinger, however, emphasized that the apparent similarities between his 
own and Lanczos’ paper are, so to speak, only on the surface; in particular, 
Lanczos’ symmetrical kernel K(s,a), whose orthogonal eigenfunctions 
served Lanczos to construct his matrices, should not be identified with the 
Green’s functions of the wave equation, for the eigenvalues of the former 
are the reciprocals of those of the latter. 
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For the sake of historical accuracy it should be noted that according to 
a statement made by Gregor Wentzel, 310 the mathematical equivalence 
between matrix and wave mechanics was independently established also 
by Wolfgang Pauli. 

With these remarks we conclude our presentation of Schrodinger’s 
remarkable papers, which were, as Bom 311 once said, “of a grandeur un¬ 
surpassed in theoretical physics.” Reviewing Schrodinger’s derivation of 
his wave equation, which according to Planck “plays the same part in 
modem physics as do the equations established by Newton, Lagrange, 
and Hamilton in classical mechanics,” 312 we see that Schrodinger’s rea¬ 
soning, as represented in his second communication, 313 was indeed inti¬ 
mately connected with Debye’s deduction of ray optics from wave optics. 
For a comparison of Eq. (5.72), page 261, with the eikonal equation 
\VW\ 2 = n 2 showed that n 2 = 2m (E — U) , in agreement with Hamilton’s 
“optical-mechanicaj analogy” (5.64), page 243. As soon as this value of 
n 2 was introduced into Debye’s wave equation Aip + n 2 k^ip = 0, as ex¬ 
plained on page 256, and k 0 was identified with 2ir/h, Schrodinger’s wave 
equation (5.77), page 263, was obtained. 

It seemed strange that Schrodinger’s solution of the hydrogen eigenval¬ 
ue equation resulted precisely in Bohr’s energy levels; for it was hard to 
understand how two methods so dissimilar as Bohr’s and Schrodinger’s 
could lead to identical results. 

The puzzle was solved by a new wave-mechanical approximation 
method which was proposed, shortly after the publication of Schro¬ 
dinger’s papers, by Wentzel 314 and, independently, by L. Brillouin 315 and 
improved by Kramers, 316 the so-called “WKB method.” 317 Our discus¬ 
sion of this method will be confined to the one-dimensional case, for which 
it is most readily applicable, and will deal with only those features which 
are directly relevant to our present subject. 

Writing Schrodinger’s equation (5.77) as 

if," + k 0 2 p 2 t(> = 0 (5.95) 

where, as before, k 0 = 2'ir/h and p 2 = 2m(E — U), Wentzel and Bril¬ 
louin wrote, just as Debye did, 

xfj = exp(ik Q W) (5.96) 

but specified in addition 

W= I ydx (5.97) 

Jxo 

where y is a function of x and x 0 is a fixed lower limit of integration. From 
Eqs. (5.95), (5.96), and (5.97) they obtained the Riccati differential 

. 'll o 

equation 
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and by expanding y in powers of A, 

y=y ° + h y,+ {'L) , ' 2+ "' 

and comparing the powers of h on both sides of Eq. (5.98), 


(5.98) 


(5.99) 


y 0 = ±p 

yo 

y i= 

2Vo 

and so on. 

The zeroth-order approximation W = fy 0 dx = J ±pdx corre¬ 
sponded, as we see, to the solution of classical mechanics while successive 
higher-order approximations were expected 319 to approach the rigorous 
solution of wave mechanics. Treating ( 2m/h)y = xp'/xp as an analytic 
function over the complex plane and integrating it along a closed contour 
around all of its poles (with residues 2m) y Wentzel concluded with the 
help of the theorem of residues that 


§ 


ip' , 2m r , _ . 

— dx =-(p y dx = 2m n 

- if; hi* 

where n is an integer. He thus obtained the Sommerfeld quantum condi¬ 
tion 



dx = nh 


However, as Kramers and his collaborators 320 showed, a more exact treat¬ 
ment leads to the result 


y p dx = (n + \)h (5.100) 

These results establish the connection between Bohr’s theory of the hy¬ 
drogen atom of 1913 or its elaborations by Sommerfeld and Schrodinger’s 
wave mechanics. The question why these quantum conditions in spite of 
their being derived from an approximation method give nevertheless the 
correct energy values for several important cases, such as the hydrogen 
atom or the linear oscillator, is a purely mathematical problem whose 
discussion would lead us too far astray. 

In the paper 321 in which Wentzel proposed the new approximation 
method he had already applied it to the Stark effect of hydrogen. As far as 
the linear effect was concerned, his result, like that of Schrodinger, was 
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identical with Epstein’s formula 322 obtained on the older quantum theory. 
For the quadratic effect, however, the term containing the square of the 
field strength F 2 turned out to be 323 

*‘3 6 « 4 [17« 2 - 3(«, - « 2 ) 2 - 9(n 3 - l ) 2 + 19] (5.101) 

2 1t m e 

Wentzel’s result (5.101) was also obtained at the same time and indepen¬ 
dently by I. Waller. 324 It differed from Epstein’s expression by the pres¬ 
ence of + 19 and by the appearance of 9 (n 3 — 1 ) 2 instead of Epstein’s 
9n 2 . 

A deviation from Epstein’s result of 1916 concerning the quadratic 
effect was detected as early as 1921 by Sommerfeld 325 in observations 
made by Takamine and Kokubu 326 in 1919 at the Mount Wilson Observa¬ 
tory. Their plates—the only material on record on the quadratic effect 
until 1925—exhibited a one-sided shift of the whole splitting pattern to¬ 
ward the red, but, as Sommerfeld noticed, not in agreement with Epstein’s 
conclusions. The shift of the middle component of H y > corresponding to 
the transition ( n l = 1 , n 2 = 1 , n 3 = 3) -► (0,0,2) and (2,2,1)-► (0,0,2), 
was carefully examined in 1925 by Kiuti, 327 who found for a field of 
140,000 volts/cm a shift of AA = 0.6 A in contrast to Epstein’s theoretical 
AA = 0.5 A. As Wentzel 328 pointed out, Kiuti’s observed value agrees 
perfectly well with the wave-mechanical result. Later investigations by 
Rausch von Traubenberg and Gebauer, 329 who applied fields up to 
400,000 volts/cm, confirmed this agreement. In fact, von Traubenberg 
and Gebauer declared at the conclusion of their experiments that “Schro- 
dinger’s theory is fully confirmed by our measurements of the red shift as 
exhibited also by the other components of H y .” 330 

Just as Mulliken’s 331 observation of the shift in the band spectrum of 
boron monoxide was the earliest evidence for the superiority of Heisen¬ 
berg’s matrix mechanics over the older quantum theory, so was Kiuti’s 
observation of the shift in the line pattern of the hydrogen Stark effect the 
earliest proof of the superiority of Schrodinger’s wave mechanics over the 
older theory. Both formalisms found support in evidence gathered prior to 
their establishment. 
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CHAPTER 6 

Statistical Transformation 

Theory 


6.1 The Introduction of Probabilistic 
Interpretations 

Within a short time after the publication of Schrodinger’s paper wave 
mechanics was successfully applied to a great number of energy-eigenval¬ 
ue problems. It also soon became clear that the theory could be extended 
to cope with types of problems not initially envisaged by its originator. 
Thus, for example, Fock 1 showed as early as June, 1926, that Schro¬ 
dinger’s equation can be generalized to apply also to problems in which 
the Lagrangian function contains linear velocity-dependent terms. Schro¬ 
dinger’s theory of radiation, based on his time-dependent perturbation 
theory, made it possible to derive Bohr’s frequency condition, which in 
the older quantum theory had the status of a separate postulate, to calcu¬ 
late the intensities and polarizations of emitted radiations and their selec¬ 
tion rules. It provided a logically consistent and mathematically conven¬ 
ient method to solve a great variety of problems, and there could be no 
doubt that it was a major advance over the older quantum theory. In 
addition, with the exception of a few cases such as the treatment of the 
angular momentum it proved to be also more practicable than matrix 
mechanics. 

Schrodinger’s physical explanation of his formalism and, in particular, 
his interpretation of the wave function did not fare so well. In order to 
account for the fact that a mechanical system can emit electromagnetic 
waves of a frequency equal to the term difference and to deduce their 
intensity and polarization, Schrodinger ascribed to ^ an electromagnetic 
meaning, defining it in the early sections of his paper 2 as a continuous 
distribution of electricity in actual space. At the end of the fourth commu¬ 
nication, where these ideas are further elaborated, he defined i/ji/j* as the 
“weight function” of this charge distribution so that/? = is the elec¬ 
tron charge density where e is the charge of the electron. Having derived 
from his time-dependent wave equation in the well-known manner the 
continuity equation 
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-P- + div S' = 0 (6.1) 

dt 

connecting the current density 

S = — tftVip*) (6.2) 

4mm 

with the scalar charge density p , he regarded Eq. (6.1) as a convincing 
confirmation of his electrodynamic interpretation. To explain the empiri¬ 
cal fact that the charge of an electron is ordinarily found to be concentrat¬ 
ed in a very small region of space, Schrodinger combined his electrody¬ 
namic interpretation of the wave function with the idea that the particles 
of corpuscular physics are essentially only wave groups composed of nu¬ 
merous, strictly speaking, infinitely many, wave functions. 

In a paper 3 published in July, 1926, on the relation between microphy¬ 
sics and macrophysics he illustrated his ideas by showing that the phe¬ 
nomenological behavior of the linear harmonic oscillator can be explained 
in terms of the microphysical conception of wave packets. From the nor¬ 
malized eigenfunctions (2" n\)~ l/2 tp n , where 

xf> n = exp( — \x 2 )H„ (x)exp(2 mv„t) and v„ = (n + i)v 0 

oo 

he constructed the wave packet ^ = ^(A /2) n ip n /n\ f where A is a con¬ 
stant large compared with unity, 4 and showed that the real part of xf> is 
given by the expression 


exp 


A 2 1 

- (x — A COS 2 7TV n t) 2 

L 4 2 


Xcos 


w 0 , + Msin2^ 0 o(,-fco S 2,rv 0 ,)] (6.3) 


Pointing out that the first factor in (6.3) represents a narrow hump of the 
shape of a Gaussian error curve, which, at a given moment t , is located in 
the neighborhood of x = A cos 27rv 0 t f and that the wave group as a whole 
does not spread out in space in the course of time, he concluded that the 
particle picture of the oscillator can be consistently interpreted as result¬ 
ing from the existence of such a wave packet. “There seems to be no 
doubt,” he continued, “that we can assume that similar wave packets can 
be constructed which orbit along higher-quantum-number Kepler ellipses 
and are the wave-mechanical-representation of the hydrogen atom.” 

It was soon recognized, however, that Schrodinger’s assumption was 
erroneous and that wave packets, in general, do spread out in space. In 
fact, it was shown by Heisenberg 5 that the oscillator is in this respect an 
exception, the reason being that its successive energy levels happen to 
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differ by equal amounts. Another difficulty was the fact that for poly- 
electronic systems—strictly speaking, also for the one-electron system if 
the motion of the nucleus is taken into consideration—^ is a function in a 
hypothetical multidimensional space and as such can hardly be regarded 
as an efficient cause of radiation phenomena as Schrodinger proposed. 
Other objections to Schrodinger’s interpretation were provided by the 
previously discussed electron-diffraction experiments by crystals and by 
collision experiments, for it was difficult to understand how in such pro¬ 
cesses of wave dispersion the stability of a particle could be preserved if it 
really was but a group of waves. 

As a matter of fact, it was precisely in connection with the quantum- 
mechanical study of such scattering processes that a new interpretation of 
the wave function was proposed. Bom, who in 1954 was awarded the 
Nobel Prize “for his fundamental work in quantum mechanics and espe¬ 
cially for his statistical interpretation of the wave function,” as the official 
decision of the Royal Swedish Academy of November 3, 1954, read, ex¬ 
plained the motives of his opposition to Schrodinger’s interpretation as 
follows: “On this point I could not follow him. This was connected with 
the fact that my Institute and that of James Franck were housed in the 
same building of the Gottingen University. Every experiment by Franck 
and his assistants on electron collisions (of the first and second kind) 
appeared to me as a new proof of the corpuscular nature of the electron.” 6 

Searching for a quantum-mechanical explanation of the process of col¬ 
lision between a free particle, such as an a particle or an electron, and an 
atom, Born adopted the formalism of wave mechanics, stating that 
“among the various forms of the theory only Schrodinger’s formalism 
proved itself appropriate for this purpose; for this reason I am inclined to 
regard it as the most profound formulation of the quantum laws.” 7 

In the paper from which the preceding quotation has been taken Born 
gave a preliminary report of his quantum-mechanical treatment of colli¬ 
sion processes. A more detailed discussion is found in two subsequent 
articles 8 in which he developed what became known as the “Born approxi¬ 
mation.” Born’s method was essentially an application of the perturbation 
theory to the scattering of plane waves, the initial and final wave functions 
being both approximately plane waves far from the scattering center. To 
the system of an electron of energy E = h 2 /2mA 2 , coming from the + z 
direction and approaching an atom whose unperturbed eigenfunctions are 
i/P n (q), he ascribed the eigenfunction rf> 0 nE (q> z ) = tfP n (q) sin (2ttz/A). 
Taking V(x,y,z,q) as the potential energy of interaction between the elec¬ 
tron and the atom and applying the theory of perturbations, he obtained 
for the scattered wave at great distance from the scattering center the 
expression 
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(x,y,z,q) 


= 2 f { d^fim(aJ3,7)sink l n ^(,ax + l3y + yz + S)i/f > m (q) (6.4) 

m J J 

where dco is an element of the solid angle in the direction of the unit vector 
whose components are a, /?, and y , and where (a,/3,y) is a wave 
function which determines what was subsequently called the differential 
cross section for the direction ( a,/3,y ). If this formula, said Born, allows 


for a corpuscular interpretation, there is only one possibility: or 

rather 9 \ip ( „n\ 2 , measures the probability that the electron which ap¬ 
proached the scattering center in the direction of the z axis is found scat¬ 
tered in the direction defined by a , /?, y . 10 Hence, Born continued, wave 
mechanics does not answer the question: What, precisely, is the state after 
the collision? but rather the question: What is the possibility of a definite 
state after the collision? In his more detailed paper 11 on collisions he 
summarized this situation by the often quoted statement: “The motion of 
particles conforms to the laws of probability, but the probability itself is 
propagated in accordance with the law of causality.” 12 

Born’s probabilistic interpretation of the wave function had a three¬ 
fold root. First of all, as explained, impressed by the corpuscular aspects of 
collision experiments, Born Rejected the undulatory interpretation and 
tried, instead, to associate the wave function with the presence of parti¬ 
cles. In his choice how to carry out this association he was influenced, as 
he repeatedly admitted, 13 primarily by Einstein’s conception of the rela¬ 
tion between the field of electromagnetic waves and light quanta. For 
Einstein, Born asserted, the electromagnetic wave field was a kind of 
“phantom field” (“Gespensterfeld”) whose waves serve to guide the cor¬ 
puscular light quanta on their path in the sense that the squared wave 
amplitudes (intensities) determine the probability of the presence of light 
quanta or, in a statistically equivalent sense, their density. Since the wave 
function for an ordinary plane light wave of frequency v = E/h and wave¬ 
length A = h //>, 

( x\ 1 [2777 

v 7yJ =exp \~h 


u(x,t) = exp 


2777V 


(Et — px) 


is identical with the de Broglie wave function of a particle of energy E and 
momentum p , i.e., the eigenfunction of the Schrodinger wave equation 

£± + ^ l e =o 


dx 


h 


where E = p 2 /2m f Born found it natural to carry over Einstein’s concep¬ 
tion of the “phantom field” to particles other than light quanta. Just as the 
intensity of light waves was a measure of the density of light quanta, Born 
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argued, “ it was almost self-understood to regard \xjj\ 2 as the probability 
density of particles.” 14 

It soon became clear, however, that the notion of probability in Bom’s 
interpretation differed from the classical notion as used, for example, in 
statistical mechanics or in the kinetic theory of gases. For, if a wave field 
tfj i with probability density P x = \x(j x \ 2 is superposed with a wave field xp 2 
with probability density P 2 = |^ 2 | 2 > the probability density of the super¬ 
posed field ^ + $2 is not, as the classical notion of probability would 

imply, P x + P 2 , but rather \i// l + r[r 2 \ 2 = P x + P 2 + + xftfi(r u where 

the last two “interference terms” are generally different from zero. 


For Einstein the notion of probability, even as he applied it to reconcile 
his light-quanta hypothesis with Maxwell’s theory of electromagnetic 
waves, was the traditional conception of classical physics, a mathematical 
objectivization of the human deficiency of complete or exact knowledge 
but ultimately a creation of the human mind or, as Spinoza expressed it, “a 
sola imaginatione ... quod res ... ut contingentes contemplemur.” 15 
But Spinoza’s statement: “De natura rationis non est, res, ut contingentes, 
sed, ut necessarias, contemplari,” 16 which dominated scientific thought 
for centuries with only a few exceptions, such as those referred to in Sec. 
4.2, lost its validity for Born and his school. For Born probability, as far as 
it was related to the wave function, was not merely a mathematical fiction 
but something endowed with physical reality, for it evolved in time and 
propagated in space in accordance with Schrodinger’s equation. It dif¬ 
fered, however, from ordinary physical agents in one fundamental aspect: 
it did not transmit energy or momentum. Since in classical physics, 
whether Newtonian mechanics or Maxwellian electrodynamics, only 
what transfers energy or momentum (or both) is regarded as physically 
“real,” the ontological status of had to be considered as something 
intermediate. It had—and that brings us to the third root of Born’s proba¬ 
bilistic interpretation—precisely that “intermediate kind of reality” 
which, as Heisenberg 17 had emphasized, transpired in the work of Bohr, 
Kramers, and Slater in 1924. Now it will be understood that Born’s inter¬ 
pretation was indeed influenced by the Bohr-Kramers-Slater conception 
of the virtual radiation field. Recalling also their treatment of induced 
emission, etc., we can understand why Heisenberg declared: “Bom estab¬ 
lished in the summer of 1926 his theory of collision processes and inter¬ 
preted correctly the wave in multidimensional configuration space as a 
probability wave by developing and elaborating an idea previously ex¬ 
pressed by Bohr, Kramers, and Slater.” 18 


Laws of nature, as Born and Heisenberg contended from now on, de¬ 
termined not the occurrence of an event, but the probability of the occur¬ 
rence. For Heisenberg, 19 as he later explained, such probability waves are 
“a quantitative formulation of the concept of Suva/iis , possibility, or in 
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the latter Latin version, potentia , in Aristotle’s philosophy. The concept 
that events are not determined in a peremptory manner, but that the 
possibility or ‘tendency’ for an event to take place has a kind of reality—a 
certain intermediate layer of reality, halfway between the massive reality 
of matter and the intellectual reality of the idea or the image—this concept 
plays a decisive role in Aristotle’s philosophy. In modern quantum theory 
this concept takes a new form; it is formulated quantitatively as probabili¬ 
ty and subjected to mathematically expressible laws of nature.” 20 

Having interpreted i/> as a probability wave in the sense just explained 
but realized that xp can be expanded in terms of a complete orthonormal 
set of eigenfunctions xjj n of the Schrodinger equation [H — W,r/j] =0, 


(6 - 5) 

n 

where in accordance with the completeness relation 



\t/>(q)\ 2 dq='£ 

n 



( 6 . 6 ) 


Born had to ask himself what meaning to ascribe to the c n . The fact that 
for a single normalized eigenfunction tfr(q), corresponding to a single 
particle, the right-hand side of Eq. (6.6) is unity suggested to Born 21 that 
the integral S\ip(q) \ 2 dq has to be regarded as the number of particles and 
\c n | 2 as the statistical frequency of the occurrence of the state character¬ 
ized by the index n. To justify this assumption Born calculated essentially 
what was later called the expectation value of the energy W for if; and 
obtained 


W=^\c n \ 2 W n (6.7) 

n 

where W n is the energy eigenvalue of xfj n . 

Born’s probabilistic interpretation scored immediate success in the 
field where it originated and where it could be applied most naturally, in 
the problems of atomic scattering. Applying Born’s approximation meth¬ 
od to the scattering of electrically charged particles by a charged scatter¬ 
ing center, Wentzel 22 derived Rutherford’s classic scattering formula 23 on 
the basis of wave mechanics. Faxen and Holtsmark 24 and later Bethe 25 
and Mott 26 used Born’s method for the study of the passage of slow and 
fast particles through matter in the course of which the Ramsauer-Town- 
send effect 27 and Dymond’s observations 28 found a satisfactory wave- 
mechanical explanation. 

Bom himself, 29 and later in collaboration with Fock, 30 attempted to 
put the probabilistic interpretation on logically stronger foundations by 
clarifying its relation to classical physics. In August, 1926, at the Oxford 
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meeting of the British Association for the Advancement of Science, 
Born 31 had declared: “We free forces of their classical duty of determining 
directly the motion of particles and allow them instead to determine the 
probability of states. Whereas before it was our purpose to make these two 
definitions of force equivalent, this problem has now no longer, strictly 
speaking, any sense. The only question is why the classical definition is so 
successful for a large class of phenomena. As always in such cases, the 
answer is: Because the classical theory is a limiting case of the new one. 
Actually, it is usually the ‘adiabatic’ case with which we have to do: i.e., 
the limiting case where the external force (or the reaction of the parts of 
the system on each other) acts very slowly. In this case, to a very high 
approximation c\ = 1, c\ = 0, c\ =0,..., that is, there is no probability for 
a transition, and the system is in the initial state again after the cessation of 
the perturbation.” 

In the mathematical elaboration 32 of these ideas Born considered the 
general solution 


= ^c n tf>„ (x) exp W n t^j 

of the time-dependent Schrodinger equation 

4mm dip n 
Aif, - U(x)t/; ---^ = 0 


( 6 . 8 ) 


(6.9) 


where the xfj n (x) are assumed as normalized. In contrast to Schrodinger, 
who interpreted Eq. (6.8), in analogy to acoustics, as asserting the exis¬ 
tence of simultaneous eigenvibrations in one and the same atom, Bom, 
adhering to Bohr’s conception that at a given time an atomic system can 
assume only one stationary state, interpreted, in accordance with the con¬ 
clusions of his study of scattering, 


c 




as the probability that at the time t the atom is in state n. Studying the 
effect of an external perturbation on the system, Born compared the ap¬ 
proach of classical mechanics with that of quantum mechanics as follows. 
The basic problem in classical mechanics, he declared, is this: at time t = 0 
the configuration (positions and velocities) of the system is given; what is 
the configuration at time t=T if the system has been acted upon by a 
given force during the interval A t from f = 0 to f = T1 The basic problem 
of quantum mechanics, on the other hand, is in his view: until t = 0 the 
probabilities ( \c n | 2 ) of a configuration are given; what is the probability 
of a given configuration after the time T, if the system has been acted upon 
by a given external force during (and only during) the interval A t ? Born 



306 


Conceptual Development of Quantum Mechanics 


emphasized that quantum mechanics considers not the individual pro¬ 
cess, the “quantum transition” or “quantum jump,” as causally deter¬ 
mined, but rather the a priori probability of its occurrence; this probability 
is ascertainable by an integration of Schrodinger’s differential equation. 
And Bom added, obviously in the spirit of Bohr’s philosophy: “Whatever 
occurs during the transition can hardly be described within the concep¬ 
tual framework of Bohr’s theory, nay, probably in no language which 
lends itself to visualizability.” 33 

Applying Schrodinger’s time-dependent perturbation theory, Born 
solved the basic problem of quantum mechanics as follows. At time t = 0 
the system is given by 


ip(x,0) =^c„ip n (x) ( 6 . 10 ) 

n 

as shown by Eq. (6.8). Its evolution is given by Eq. (6.9) with U(x) 
replaced by U(x) + F ( x,t ), where F ( x,t ) vanishes for t <0 and t> T. 
Considering the special case ^(x,0) = (x), Born showed that the inte¬ 

gration of the differential equation yields for t > T 

= ^b nm ip m (x) ex P0p (6.11) 

where the coefficients b nm are uniquely determined by the total behavior 
of F (x,0 during A t. Born now declared that \b nm | 2 is the probability that 
the system, which for t <0 was in state n [i.e., (x)], isfor/> Tin state 

m [i.e., x^ m (x)]. In other words, \b nm \ 2 is the “transition probability” 
(“Ubergangswahrscheinlichkeit”). Considering, now, the general case 

where the initial state was given by tf/(x, 0) =^c n t// n (x), and where, 

n 

consequently, for t > T 


i/>(x,t) = £ c n ip„ ( x,t ) ( 6 . 12 ) 

n 

with the (x,0 given by Eq. (6.11), Bom calculated the general transi¬ 
tion probability |C„| 2 —corresponding to the representation 

tfr(x,t) =^C n tf/ n (x) for t > T —and obtained by making use of the orth- 

n 

onormality of the x/j n (x) 

c n = J i/>(x,Dffi(x)dx = £ b mnC m exp W„T^j 

so that 


C n 

2 _ 

^ Cm &mn 



m 


(6.13) 
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From Eq. (6.13) Bom concluded that the quantum transitions be¬ 
tween a state m and a state n cannot be regarded as independent events in 
the sense of the classical theory of probability; for, were this the case, the 
right-hand side of Eq. (6.13) should read, in accordance with the theorem 
of composite probabilities, 


XI C m| 2 |^mJ 2 (6.14) 


which generally differs, of course, entirely from 


^ Cm bmn 


2 


Thus we see that in the course of proving the wave-mechanical equiva¬ 
lent of the adiabatic theorem, namely, that for infinitely slow perturba¬ 
tions the transition probabilities are zero, Bom advanced two theorems 
which were destined to play a fundamental role in the further develop¬ 
ment of quantum theory, its interpretation, and its theory of measure¬ 
ment: (1) the theorem of spectral decomposition according to which 
there corresponds a possible state of motion to every component xjj n in the 
expansion or superposition of i/s; (2) the theorem of the interference of 
probabilities according to which the phases of the expansion coefficients, 
and not only their absolute values, are physically significant. 

These results were independently obtained at the same time also by 
Dirac 34 and incorporated into the construction of the transformation the¬ 
ory of quantum mechanics, a generalization of the matrix and wave me¬ 
chanics as developed so far. 

Schrodinger’s electrodynamic and Bom’s probabilistic conceptions 
were not the only interpretations proposed at that time. At least two other 
suggestions have to be mentioned which are important not only for histor¬ 
ical reasons: they still reverberate in current discussions on the founda¬ 
tions of quantum mechanics. 

Realizing that for ^ = ae lP with time-dependent a and /? Schro¬ 
dinger’s time-dependent wave equation implies 


div (a 2 grad<p) + 



(6.15) 


where <p= — [3h /lirm, Madelung called attention to the fact that this 
equation (6.15) has the form of the continuity equation in hydrodynamics 
if a 2 is taken as the density and q) as the velocity potential of a fluid in 
motion. Elaborating these ideas, Madelung 35 showed that every eigenso- 
lution of the wave equation, although a function of time, can be interpret¬ 
ed as a stationary flow pattern. As the hydrodynamical model represented 
also other essential features of Schrodinger theory, Madelung suggested 
that “there is a chance to treat the quantum theory of atoms on this basis.” 
He had to admit, however, that not all phenomena—and, in particular, 
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those related to absorption processes—could be consistently interpreted 
as yet by such a hydrodynamical model. 

In an alternative interpretation, proposed at about the same time, 
Louis de Broglie combined Born’s probabilistic approach with certain 
ideas which Einstein had tentatively advanced in 1909, when he regarded 
light quanta as singularities 36 of a field of waves. In his first paper on this 
subject de Broglie 37 confined his treatment to light quanta. The solution of 
the wave equation 

A 1 d 2 u 

A « = ——— (6.1o) 

c 2 dt 2 


in classical optics, he argued, is given by a function of the form 

u = a(x 9 y,z) exp {ico[t — <p(x,y,z) ]} (6.17) 

which satisfies the boundary conditions imposed by the presence of 
screens, apertures, or other obstacles encountered by the waves; in “the 
new optics of light quanta,’’ on the other hand, he contended that its 
solution is given by a function of the form 

u =f(x>y f z,t) exp {io)[t — <p(x,y,z)]} (6.18) 

where q? is the same function as before, but / (x,y,z,t) has “mobile singu¬ 
larities’’ (“singularity mobiles’’) along the curves n normal to the phase 
fronts q) = const. “These singularities,’’ he declared, “constitute the 
quanta of radiative energy.’’ Substituting (6.18) in (6.16) and taking only 
the real part, de Broglie obtained 


SU'+l/A* >2. 

dn dn 2 c 2 dt 


(6.19) 


Reasoning that the quotient f /{df /dn ) vanishes at the position M of the 
particle, on grounds only partially explained, and knowing that the veloc¬ 
ity of the quantum of light when passing through M is given by 38 


K= ( df/dt ) 

\df /dn )M,t 

he concluded that 


( 6 . 20 ) 


V= c 1 (^\ ( 6 . 21 ) 

V dn ) m 

so that q >—just as Madelung’s q >—plays the role of a velocity potential. 
Denoting by p the density of the light quanta, de Broglie utilized argu¬ 
ments taken from the mechanics of fluids to show that 


p = consta 2 (6.22) 

or, in other words, that the light-quanta density is proportional to the 
intensity of the radiation. 
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In a second paper 39 de Broglie carried over these considerations for the 
interpretation of the Schrodinger wave function and the motion of parti¬ 
cles. “In micromechanics as in optics/’ he declared, “continuous solu¬ 
tions of the wave equation provide merely statistical information; an exact 
microscopic description undoubtedly necessitates the usage of singularity 
solutions reflecting the discrete structure of matter and radiation.” 40 

In the spring of 1927 de Broglie brought these ideas to maturity and 
presented them in the form of what he called “the theory of double solu¬ 
tion.” 41 According to this theory the linear equations of wave mechanics 
admit two kinds of solution: a continuous ^ function with statistical sig¬ 
nificance and a “singularity solution” whose singularities constitute the 
physical particles under discussion. On the region between the two solu¬ 
tions de Broglie made the following statements: “Particles would then be 
clearly localized in space, as in the classical picture, but they would be 
incorporated in an extended wave phenomenon. For this reason, the mo¬ 
tion of a particle would not obey the laws of Classical Mechanics accord¬ 
ing to which the particle is subject only to the action of forces exerted on it 
in the course of its trajectory, without experiencing any effect from the 
existence of obstacles that may be situated at some distance outside the 
trajectory. In my conception, on the contrary, the motion of the singular¬ 
ity was to be dependent on all the obstacles that hindered the free propaga¬ 
tion of the wave phenomenon surrounding it, and there would result from 
this a reaction of the wave phenomenon on the particle—a reaction ex¬ 
pressed in my theory by the appearance of a ‘quantum potential’ entirely 
different from the potential of ordinary forces. And in this way the appear¬ 
ance of interference and diffraction phenomena would be explained.” 42 


6.2 The Transformation Theory 

Although the quantum-theoretic transformation theory originated in an 
attempt to solve a specific problem of rather limited scope, it soon proved 
itself conducive to such far-reaching generalizations that it led eventually 
to a unifying synthesis of all the diverse approaches and provided through 
its abstract formulation of the basic principles a deeper insight into the 
nature of the theory. A comparison with classical mechanics seems to be 
instructive. Quantum theory prior to the advent of the transformation 
theory may be compared with Newtonian mechanics prior to, say Pois¬ 
son’s introduction of generalized momenta; and just as the development of 
the canonical formalism in classical dynamics provided in the work of 
Jacobi, Poincare, and Appell a profound comprehension of the whole 
structure of classical mechanics, so the development of the quantum-me- 
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chanical theory of transformations culminated with the work of Dirac, 
Jordan, and von Neumann in the achievement of new vistas which made it 
possible to view the nonrelativistic quantum mechanics of a finite number 
of degrees of freedom as a logically consistent, compact, and unified sys¬ 
tem of thought. 

Owing to the expansive trend of this development, a clear-cut or uni¬ 
versally accepted definition of the subject matter of the transformation 
theory is hardly found in literature. To mention only two examples, in the 
1960 edition of Sommerfeld’s Atombau und Spektrallinien 43 the chapter 
on transformation theory deals almost exclusively with the Fourier trans¬ 
formation of dynamical variables whereas in the 1964 edition of Eisele’s 
Advanced Quantum Mechanics and Particle Physics 44 the discussion of the 
transformation theory focuses exclusively on the relation between the 
Schrodinger, Heisenberg, and interaction pictures. To cover such ex¬ 
tremes it is possible to define the transformation theory as the study of 
those transformations in quantum theory which leave the results of em¬ 
pirically significant formulas invariant. The various conceptions of the 
subject matter of the theory then depend on the kind of space with respect 
to which the performance of transformations is to be considered. In fact, 
while the early phases of the development were characterized by transfor¬ 
mations within the configuration and momentum space, its later elabora¬ 
tions led to the conception of abstract Hilbert spaces as the arena underly¬ 
ing the transformations. In this course the formerly important notion of 
canonical transformations in quantum mechanics gradually lost its pecu¬ 
liarity and importance. 

The early development of the transformation theory from its incep¬ 
tion 45 in the spring of 1926 until its elaboration to what is usually called 
the Dirac-Jordan transformation theory is the subject of our present dis¬ 
cussion. 

Matrix mechanics, as developed so far, if viewed against the back¬ 
ground of classical analytic dynamics, was confined to the treatment of 
libration coordinates, in contrast to the older quantum theory which 
worked primarily with angle coordinates. In fact, while for the older 
quantum theory in its search for canonical transformations which reduce 
a problem to cyclic motions the rotator was the basic prototype, for matrix 
mechanics it was the oscillator . 

The specific problem which, as mentioned above, gave rise to the devel¬ 
opment of the transformation theory even before the advent of wave me¬ 
chanics was the question: could the Hamilton-Jacobi method of classical 
dynamics, which, it will be recalled, provided directly the frequencies of 
multiply periodic systems, be carried over into matrix mechanics? An 
important step in this direction was made by Dirac in a paper 46 already 
referred to. In it, incidentally, Dirac introduced a one-letter notation for 
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h /2ir which, slightly modified, 47 will be used from now on also in our 
present text. Dirac showed that the determination of the frequencies for 
multiply periodic systems can be reduced to the problem of finding a 
system of canonical variables J and w which—similar to their classical 
prototypes—satisfy the conditions 

= [w r) u; s ] =0 [ w r ,J s ] =S rs 

where the quantum-mechanical Poisson brackets [x,y] are defined by 

xy —yx = ih [x,y] 

the Hamiltonian H is a function of only the /, and the original p and q 
describing the system are multiply periodic functions of the w of period 
2i t. For the hydrogen atom Dirac was able to calculate the J and to re¬ 
trieve 48 the Balmer formula, but was unable to calculate the intensities. A 
method for calculating the J was subsequently proposed by Wentzel, 49 
who adapted to this end the Sommerfeld method of complex integration 
to the matrix calculus and applied it successfully in the case of the har¬ 
monic oscillator and of the hydrogenic atom. 

The introduction of canonical angle and action variables in quantum 
mechanics was further promoted by Jordan, 50 who gave a rigorous proof 
of a theorem whose validity had already been assumed six months earlier 
in the “three-man paper,” 51 namely, that every canonical transforma¬ 
tion, that is, a transformation which leaves the commutation relations 
invariant, can be written as 

P = SpS~ 1 Q = SqS~\ (6.23) 

Jordan also proved that every point transformation, in classical mechan¬ 
ics always a canonical transformation, is so also in quantum mechanics. 
But in spite of Jordan’s important results the transformation (6.23) 
proved of little practical use, primarily because the calculation of recipro¬ 
cal matrices S ~ 1 was a difficult task. Only in the case of infinitesimal 
canonical transformations, where 


and consequently 

S= 1 + AS, +A 2 S 2 + • • • 

(6.24) 

S ~ 1 

= 1 - /IS, +/l 2 (S? -S 2 ) + • • • 

(6.25) 


could these transformations be carried out to be used in a matrix-pertur¬ 
bation theory for the solution of the principal-axes problem. Matrix me¬ 
chanics thus was capable of dealing with complicated perturbation prob¬ 
lems, once a solution of the unperturbed problem was available, but failed, 
in general, to solve problems which could not be considered as approxima¬ 
tions to solved cases. 
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A serious difficulty for the introduction of angle and action variables in 
matrix mechanics was the fact that the quantum conditions 


ft 

pq-qp = — 


i 


(6.26) 


could not be satisfied for constant p , that is, a diagonal matrix p. In May, 
1926, London 52 attempted to circumvent this difficulty by generalizing 


(6.26) to 


pE(inq) — E(inq)p = nhE(inq) (6.27) 

oo 

where E(q) is defined as the matrix function ^ q 5 /s\. 

5 = 0 

For constant p — ( Jkk ) London obtained from (6.27) 

hE u = ( J k k ~Ju)Eki (6.28) 

or, since for distinct J kk for each index k only one index / exists with 
Ekii 1 ®* 

Jkk ~ Jk-\,k-\ —h (6.29) 

so that in agreement with the older quantum theory 

J kk =kh + const. (6.30) 

On the other hand, by taking lim n -►O, he showed that (6.27) reduces to 
(6.26). Elaborating this approach, London adapted the Hamilton-Jacobi 
method of introducing action and angle variables to matrix mechanics 
and concluding that the principal problem for matrix mechanics is the 
determination of the transformation generator S. 

Meanwhile, however, Schrodinger’s proof of the formal equivalence 
between matrix and wave mechanics had appeared and diverted the inter¬ 
est of physicists from the purely matrix-mechanical methods to the wave- 
mechanical approach. For it was hoped that also these problems could be 
solved with much less effort by the more advanced analytical apparatus of 
wave mechanics. 

In fact, London 53 himself was the first to carry over, in the fall of 1926, 
the matrix-mechanical transformation theory, incomplete as it still was, 
into the conceptual framework of Schrodinger’s wave mechanics, which 
had worked so far only with representations in the configuration space. In 
contrast to Schrodinger, who had used only point transformations 
Q=f(q) for solving the eigenvalue equation 




(6.31) 


London, in analogy to the matrix-mechanical use of canonical transfor¬ 
mations, considered S = S(q,P) as the generator of the transformation 
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c _ dS[q,Wi)(d/dQ)] ft d _ dS [q, (fi/i) (d/dQ) ] 

dP i dq dq 

(6.32) 

which, as Jordan 54 had shown, could be written, with T = T(Q f d /dQ) , in 
the form 


q=T~'QT —=T~ i — T. (6.33) 

dq dQ 

London now showed that the eigenvalues of (6.31) are invariant under 
the transformation (6.33). For, he said, let 




(6.34) 


be the eigenvalue equation for the transformed function xjr y here 55 

H = T~ X HT. (6.35) 


By multiplying both sides of Eq. (6.34) by T, he obtained 

IHT$\ =E[Tj>] (6.36) 

or, as he tacitly assumed 56 as equivalent, 

[H,T}] =E[Tj>]. (6.37) 

But (6.37) is identical with (6.31) for 

tp= [Tj>] (6.38) 

and the invariance of the eigenvalues has been established. 

Expanding the orthonormal eigenfunction 

lM0)= [T$ k {Q)\ (6.39) 

in terms of the orthonormal eigenfunctions tp k (Q), 


(M0=X^,.(0 (6.40) 

i 

London obtained in the usual manner 

jW k dQ = 8j k =2TTjT lk (6.41) 

or 57 

T*T= 1. (6.42) 

Having thus shown that T satisfies the “Hermitian orthogonality condi¬ 
tion,” that is, in modern terminology, that 7”is a unitary matrix, London 
interpreted (6.40) as a rotation, in the infinite-dimensional Hilbert space, 
of the coordinate system spanned by the orthogonal eigenfunctions. He 
also showed that the matrix elements F kl , such as those of the electric 
dipole moment, are invariant under such rotations. For, he contended, if 
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F kl = J r.Ftp,) dQ (6.43) 

F kl = X f (T lk j,,)*TFT- l (T j rf j )dQ 

ij J 

= ( T*TFT~ l D k i =F kl . (6.44) 

Hence, London concluded, under rotations in Hilbert space intensities are 
rotational invariants whereas energy eigenvalues are affine invariants. 

It was only after the completion of his paper that London fully appreci¬ 
ated the intimate connection between his new conceptions and the theory 
of linear operators in functional spaces or, as it was called at that time, the 
“theory of distributive functional operations.” In a footnote added to the 
galleys, London referred to a paper 58 by Cazzaniga, to a memoir 59 by 
Pincherle, and to the latter’s resume article 60 in the Encyklopadie der 
mathematischen Wissenschaften as presentations of the abstract math¬ 
ematical theory that underlies his new interpretation. When London add¬ 
ed this footnote to page 199 of his paper, he could hardly have been aware 
of its historical importance. It was the first reference to the future lan¬ 
guage of theoretical physics. For just as the differential and integral calcu¬ 
lus was the language of classical dynamics or the tensor calculus that of 
relativity, the medium of expression for modem quantum mechanics 
turned out to be the theory of linear spaces and, in particular, of Hilbert 
spaces, or, more generally, the theory of functional analysis. In view of 
those circumstances a brief digression on the development of functional 
analysis prior to London’s paper seems to be appropriate. 

The earliest conception of a comprehensive theory of linear spaces was 
probably Grassmann’s Ausdehnungslehre, 6i whose original version, 62 
first published in 1844, was newly formulated and greatly amplified in 
1862. In his ambitious attempt to formulate a theory of abstract linear n- 
dimensional spaces, Grassmann introduced, defined, and employed cor¬ 
rectly such fundamental notions as linear independence, bases, inner 
products, orthogonality, normalization, as well as certain forms of tensor- 
ial quantities and linear operators. The importance of these innovations 
for geometry and algebra alike was not recognized until Schlegel 63 drew 
the attention of mathematicians to these ideas. The first to suggest an 
extension of Grassmann’s approach to spaces of infinitely many dimen¬ 
sions was Peano, whose statement “un sistema lineare puo anche avere 
infinite dimensioni’’ 64 initiated the theory of linear infinite-dimensional 
spaces. By combining the algebraization of geometrical concepts with the 
operator calculus as developed during the early second half of the last 
century primarily by British mathematicians, 65 mathematicians like Pin¬ 
cherle, Cazzaniga, Calo, Carvallo, Amaldi, and Volterra laid the founda- 
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tions of functional analysis. For it was soon recognized that certain fam¬ 
ilies of functions display with respect to their algebraic and metric 
structures properties of the same character as vectors in the finite-dimen¬ 
sional Euclidean spaces of analytical geometry or their newly conceived 
generalizations to infinitely many dimensions. It was in this connection 
that Pincherle and Cazzaniga, in the articles referred to by London, de¬ 
fined functional spaces as sets of power series and declared: “Every power 
series is an element or point of such a space and the set of coefficients in the 
series may be regarded as the set of its coordinates.” 66 Volterra seems to 
have been the first to apply the geometry of infinite-dimensional spaces to 
problems of classical analysis. He was soon followed by Moore, who an¬ 
ticipated a number of important features of modem functional analysis in 
his so-called “general analysis.” 67 Frequently encountered similarities 
among results obtained in apparently completely disparate branches of 
mathematics, such as the calculus of variations, the theory of differential 
and integral equations, problems of oscillations in continuous media, ap¬ 
proximations, etc., suggested the establishment of a conceptual unifica¬ 
tion, or as Moore phrased it: “The existence of analogies between central 
features of various theories implies the existence of a general theory which 
underlies the particular theories and unifies them with respect to those 
central features.” 68 

Although intended originally to serve as a maxim of research in pure 
mathematics, this “general heuristic principle of scientific procedure,” as 
Moore 69 once called it, proved valid also with respect to the various for¬ 
malisms in quantum physics. In fact, just as functional analysis and its 
theories of operators in linear spaces both developed from, and led to, a 
unified point of view with respect to multifarious facts and approaches in 
mathematics, so also the functional-analytical formalism of quantum me¬ 
chanics led, as we shall see in due course, to a logical unification of all 
quantum-theoretic approaches. 

With the advent of the modern point-set topology, as founded by 
Maurice Frechet and Felix Hausdorff, which made it possible to apply 
topological notions to algebra and analysis, and with the development of 
the theory of integral equations through the work of Erik Ivan Fredholm, 
David Hilbert, Erhand Schmidt, and Frederic Riesz, functional analysis 
became one of the most important branches of modern mathematics and, 
as we shall see later on, the mathematical foundation of quantum mechan¬ 
ics. 

Returning now to the paper by London, we note that it started by 
applying canonical transformations to the wave mechanics of discrete 
eigenvalue problems and ended up with discrete transformation matrices. 
A few weeks later Dirac published a paper 70 which began by applying 
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canonical transformations to continuous or discrete matrices in Dirac’s 
matrix mechanics and ended up with the wave mechanics of continuous or 
discrete eigenvalue problems. Dirac’s work thus complemented London’s 
in two respects: it showed, so to speak, the reversibility of the conceptual 
process under discussion and generalized it to continuous transforma¬ 
tions. 

Reviewing this crucial phase of his work—for Dirac “the growth of the 
use of transformation theory ... is the essence of the new method in theo¬ 
retical physics,” as he stated in the preface to the first edition of his influ¬ 
ential textbook The Principles of Quantum Mechanics 71 —Dirac com¬ 
mented: “A great deal of my work was just playing with equations and 
seeing what they give.” 72 More specifically, he declared: “After people 
had established the equivalence between the matrix and the wave theories, 
I just studied their work and worked on it, and tried to improve it in a way 
that I had done several times previously. I think the transformation theory 
came out from that.” 73 When studying the work previously done on this 
subject, Dirac was particularly impressed 74 by Lanczos’ paper 75 of 1926 
which, as we have seen, reformulated matrix mechanics as a continuum 
theory in terms of integral equations. Although Dirac, with his character¬ 
istic modesty, declared 76 that his transformation theory should be regard¬ 
ed merely as a generalization of Lanczos’ paper, he certainly gave his 
theory a new and independent logical foundation by boldly introducing 
one of the most useful mathematical devices in quantum theory: the fam¬ 
ous 5-function or “Dirac function” as it was subsequently called. His early 
interest in algebraic operators, 77 his study of Heaviside’s operator calculus 
in electromagnetic theory, his training as an electrical engineer, and his 
familiarity with the foundations of the modem theory of electric pulses all 
contributed to this effect. In fact, Dirac declared: “All electrical engineers 
are familiar with the idea of a pulse, and the 5-function is just a way of 
expressing a pulse mathematically.” 78 

It would be wrong, however, to assume that the earliest appearance of 
the 5-function was in connection with electrical pulses. It was rather in 
connection with the use of Green’s theorem in the study of Huygen’s 
principle that Gustav Kirchhoff 79 defined the 5-function, which he de¬ 
noted by F (f), for the first time, in 1882, as follows: “As to the function F 
we assume that it vanishes for all finite positive and negative values of its 
argument, but that it is positive for such values when infinitely small and 
in such a way that 

[F{£)d£= 1 

where the integration extends from a finite negative to a finite positive 
limit.” Kirchhoff justified his assumption with the remark that 
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~~ exp( — fi 2 g 2 ) 

ylTT 

approximates F for very large //, and he emphasized that “F (f) together 
with its derivatives is for all values of f, zero included, a finite and contin¬ 
uous function of its argument.” 

More than ten years later Heaviside 80 introduced the 5-function or its 
equivalent into the electromagnetic theory; prior to its use in quantum 
mechanics it was also employed in statistical mechanics by Paul Hertz 81 in 
connection with his statistical conception of temperature. That the S- 
function is only a convenient mathematical artifice was clearly recognized 
by Dirac when, in contrast to KirchhofF, he said: “Strictly, of course, <5(x) 
is not a proper function of x, but can be regarded only as a limit of a certain 
sequence of functions. All the same one can use S(x) as though it were a 
proper function for practically all the purposes of quantum mechanics 
without getting incorrect results. One can also use the differential coeffi¬ 
cients of 5(x), namely, 5 '(jc),5"(jc)..., which are even more discontin¬ 
uous and less ‘proper’ than 5(x) itself.” 82 

Thus defining <5 (x), as usually, by the conditions: <5 (x) = 0 for all x ^ 0 
and S dxS(x) = 1, and showing that 

J oo 

f(x)S in) (a — x)dx = f {n) (a) (6.45) 

— oo 

Dirac applied it in order to express the elements of a continuous unit 
matrix 1 (a'a"), labeled by the continuous parameters or indices a ' and 

a", 

1 (a'a") =fi(a'-a*) 

or to express the elments of a general continuous diagonal matrix, 

f(a'a") =f(a')8(a' — a"). (6.46) 

Considering now the canonical transformation of a dynamical variable 
g into G , 

G = bgb ~ 1 (6.47) 

Dirac expressed (6.47) in terms of continuous matrices as follows: 


g(i’S') = f f (£'/a')da'g(a'a'')da"(a"/i"). (6.48) 


In (6.48) the primed or multiply primed continuous parameters (c- 
numbers) number the rows and columns of the matrix elements, (£ Va ’), 
( a " /£") denote the “transformation functions” b(g'a') f b~ l ( a! £ "), re¬ 
spectively, and unprimed letters—with the exception of the d f of course— 
denote dynamical variables (^-numbers). For the equivalent relations 
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Gb = bg (6.49) 

b~ l G = gb-' (6.50) 

Dirac wrote 

(£"/«') 

= (£'/a")da"g(a"a’)=g(£'a') (6.51) 

(a'/g")dg"g(g"£') 

= g{a'a")da" {a"/£,') =g(a'£') (6.52) 

Here the expressions g(g'a') and g(a'g') “may be regarded as the ele¬ 
ments of two matrices that represent the dynamical variable g according 
to two new more general schemes, in which the rows and the columns of 
the matrices refer to different things,” and the matrix elements of both G 
and g are denoted by g(g 'g ") andg(a"a'), respectively, since the G is the 
same function of the transformed variables as g was of the untrans¬ 
formed . 83 

Dirac knew that in a matrix scheme in which the possible values (c- 
numbers) of the dynamical variable £ serve to number rows and columns, 
the matrix representing £ is diagonal or 

But, he asked, what, in this same matrix scheme, is the matrix that repre¬ 
sents a dynamical variable 77 which is canonically conjugate to £, that is, 
for which 

iV ~ Vi = & (6.54) 

holds? He found that 

v(£’£”) = -««'(£'-£") (6.55) 

and showed, using integration by parts, that indeed 

Hv ~ V§) (6.56) 

Considering the transformation between two matrix schemes and apply¬ 
ing (6.51) to £ and 77 , 

i(i'a') = \i(i'£")di'(£ "/a') (6.57) 

rj(g'a') = f 7 i(£'£")d£ "{§'/(*') 


(6.58) 
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Dirac deduced from (6.53) that 


£{£'a')=i'(£'/a') 


and from (6.56), using integration by parts, that 


(6.59) 


ir(i'a') = -ifi[ 8'(£'-£")d£"(£"/a') 


= - it d (£’ /a '\ (6 60 ) 

d£' 

Thus he found that quite generally for any function / (£, 77 ) of the canoni¬ 
cally conjugate dynamical variables £ and 77 


f{£, V ){£'a') =/(£', ~ m jp) <£'/«')• < 6 ' 61) 

Formula (6.61), Dirac pointed out, makes it possible to obtain the matrix 
scheme (a) for which a given function f (£,rj) becomes a diagonal matrix 
so that, in accord with (6.46) 


/(«'«") =f(a')8(a' — a"). 

For from Eqs. (6.61) and (6.51) it follows that 

^' /a "> =/(£?7)(r«') 

= J (£'/a")da" f(a"a') =f(a')(g'/a’) (6.62) 


which is an ordinary differential equation for (£'/a'), considered as a 
function of g ' and parameterized by a'. In particular, if g and rj are identi¬ 
fied with the position and momentum coordinates q and p , respectively, 
and/ (£, 77 ) and the Hamiltonian H of the system, and if (£'/a') is writ¬ 
ten 84 2 LSxjj E (q) with /(a') = E, Eq. (6.62) turns out to be 


= Eil> E { q ) (6.63) 

which is precisely Schrodinger’s time-independent wave equation. Dirac 
thus could declare: “The eigenfunctions of Schrodinger’s wave equation 
are just the transformation functions (or the elements of the transforma¬ 
tion matrix previously denoted by b) that enable one to transform from 
the ( q ) scheme of matrix representation to a scheme in which the Hamil¬ 
tonian is a diagonal matrix .’’ 85 The proof as presented has been outlined, 
for the sake of simplicity, for the simplest conceivable case but can obvi¬ 
ously be generalized, as Dirac pointed out, to the general case of a number 
of degrees of freedom and for eigenvalue spectra which contain both con¬ 
tinuous ranges and discrete sets of values. 
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Having realized that the transformation function (£ '/a') is a general¬ 
ization of Schrodinger’s wave function x/j just as the differential equation 
(6.62) is a generalization of Schrodinger’s wave equation (6.63), Dirac 
extended Born’s statistical interpretation of xjj to the more general (£ '/a'). 
In Bom’s wave-mechanical treatment of time-dependent perturbations, it 
will be recalled, the wave function xf/(q,t) of the perturbed system, at time 
t , was expanded in terms of the unperturbed eigenfunctions ip n (q), 

=^c n (.t)if> n (q) (6.64) 

n 

and | c n ( t) | 2 was interpreted as the probability that a transition to the state 
characterized by n took place. For a continuous energy range Eq. (6.64) 
had to be replaced by 

i/>(q,t) = J c(E,t) dE xjj E {q) (6.65) 

and \c\(E,t) \ 2 dE was interpreted as the transition probability to a state 
whose energy lies between E and E + dE. Dirac now reformulated in 
terms of his transformation matrices the preceding perturbation treat¬ 
ment as well as Born’s treatment of the scattering of electrons by atoms 
and found that full agreement is obtained if it is assumed that “the coeffi¬ 
cients that enable one to transform from the one set of matrices to the 
other are just those that determine the transition probabilities.’’ 86 

This statistical interpretation ascribed a physical meaning to every 
transformation from one matrix scheme to another—and not only to the 
transformation from the (q) scheme, in which position matrices are diag¬ 
onal, to the ( E ) scheme, in which Hamiltonian or energy matrices are 
diagonal. Dirac’s association of transformation coefficients with probabil¬ 
ities was obtained at about the same time, though formulated only for the 
discrete case, by Heisenberg 87 on the basis of considerations concerning 
energy fluctuations. Also Pauli, it seems, independently recognized at 
that time the possibility of generalizing Born’s interpretation. In an im¬ 
portant footnote in his paper 88 on Fermi’s statistics 89 Pauli formulated 
Born’s interpretation of the Schrodinger function as follows: the probabil¬ 
ity of finding the position coordinates q l9 q 2 , ..., q f of a system of N 
particles in the volume element dq x dq 2 * • * dq f of the configuration space 
is given by \i/>(q x ,q 2 9 ...,#/) | 2 dq x dq 2 * * * dq f provided the system is in 
the state characterized by xjj. In a letter 90 to Heisenberg, Pauli wrote that 
“Born’s interpretion may be viewed as a special case of a more general 
interpretation; thus, for example, \x//(p)\ 2 dp may be interpreted as the 
probability that the particle has a momentum between p and p -\- dp.” In 
general, Pauli argued, there always exists, for every two quantum-me¬ 
chanical quantities q and a function qpiqfi) which he called “probabili- 
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ty amplitude” (“Wahrscheinlichkeitsamplitude”) and which has the 
property that | cp ( q^So ) 1 2 dq is the probability that q has a value between q 0 
and q 0 + dq 0 if (3 has the fixed value /? () . 

Pauli’s remarks furnished the basis on which Jordan 91 proposed at the 
end of 1926, independently of the work of London and of Dirac, his for¬ 
mulation of the transformation theory. Although ultimately in full agree¬ 
ment with these earlier formulations, as Jordan subsequently recognized, 
it adopted a totally different point of view. In its formal aspect it was 
essentially an axiomatic approach and was based on Pauli’s probability 
amplitude which was postulated to satisfy three major conditions: ( 1 ) 
q){q,/3) is independent of the mechanical nature (Hamiltonian function) 
of the system and depends only on the kinematic relation between q and /?; 
(2) the probability (density) that for a fixed value /3 0 of/ 3 the quantum- 
mechanical quantity q has the value q 0 is the same as the probability 
(density) that, for a fixed q 0 of q , /.3 has the value/? () ; (3) probabilities are 
combined by superposition, that is, if <p(x,y) is the probability amplitude 
for the value x of q at the fixed valuer of / 3 , and ;jf(x:j>) is the probability 
amplitude for the value x of Q at the fixed value of q, then the probability 
amplitude for x of Q at the fixed value y of /3 is given by 

In the particular case Q = /?, Jordan’s <t>(x:,j>) becomes Dirac’s 8(x — y). 

According to Jordan p is defined as the momentum canonically conju¬ 
gate to q if the probability amplitude p (x,y) , for every possible value x of/? 
at fixed y of q , is given by 


p ( x,y ) = exp 


xy 

ifi 


( 6 . 66 ) 


Hence, Jordan inferred, for a fixed value of q all possible values of p are 
equally probable . 92 As he pointed out, p ( x f y) satisfies the differential 
equations 



(x,y) = 0 
p (x,y) = 0 . 


(6.67) 

( 6 . 68 ) 


Let cp(x,y), he continued, be the probability amplitude that Q has the 
value x at y of q and (x,y) the amplitude that Q has the value jc at p=y; 
then 


<p(x,y) = 0 ( x,z)p ( z,y)dz . 


(6.69) 
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Introducing the linear operator 

7"=^ dxQ{y,x) • • • (6.70) 

Jordan could write Eq. (6.69) in the form 

<; p(x,y) = Tp ( x,y ). (6.71) 

It will have been recognized that Jordan’s Eqs. (6.67) and (6.68) are 
essentially the Schrodinger equation for a free particle, p ( x,y ) being the 
(conjugate) Schrodinger wave function for a particle of constant momen¬ 
tum. By a skillful application of the operator technique based on the intro¬ 
duction of Hermitian conjugate operators, Hermitian operators, and simi¬ 
lar notions which subsequently gained great importance for the modern 
formulation of quantum mechanics, Jordan proved that his formal theory 
contains not only Schrodinger’s wave mechanics and Heisenberg’s matrix 
mechanics, but also the Born-Wiener operator calculus and Dirac’s q- 
number algorithm as special cases. 

From the metascientific or methodological point of view Jordan’s 
transformation theory was an important achievement: it comprised all 
preceding quantum-mechanical formalisms in one unified formulation. It 
satisfied a deep-rooted and perennial desire of the human intellect. For the 
quest for a conceptual synthesis which, as we have seen, lay at the founda¬ 
tion of Moore’s “general heuristic principle of scientific procedure” 93 is 
much older than modern physics or mathematics. In fact, whenever for¬ 
mally diverging theories explained the same phenomena, science had 
searched for such a unification. Thus, in ancient astronomy, as Theon of 
Smyrna 94 reported, “Hipparchus deemed it worthy of investigation by 
mathematicians why on two hypotheses so different from one another, 
that of eccentric circles and that of concentric circles with epicycles, the 
same results appear to follow.” The analogous problem in the physics of 
the atom, the question why on four hypotheses so different from one 
another, those proposed by Heisenberg, Born-Wiener, Schrodinger, and 
Dirac, “the same results appear to follow,” found its answer in Jordan’s 
transformation theory. The only unsatisfactory aspect of Jordan’s work 
was the intricacy of its mathematical exposition, which confined its full 
appreciation to only a few experts at that time. Kennard’s lucidly written 
resume 95 of contemporary quantum mechanics, based primarily on Jor¬ 
dan’s approach, contributed to a wider dissemination of Jordan’s ideas. A 
full satisfactory and mathematically perspicuous presentation of the 
transformation theory and its unifying character was made possible, how¬ 
ever, only after functional analysis became the language of quantum me¬ 
chanics. 



Statistical Transformation Theory 


323 


6.3 The Statistical Transformation Theory in Hilbert Space 

The transformation theory, both in Dirac’s continuous matrix formula¬ 
tion and in Jordan’s semiaxiomatic formalism, with its emphasis on the 
notion of probability amplitudes, was the first indication that it might be 
possible to dispense with the correspondence principle in the construction 
of the conceptual edifice of quantum mechanics. For it made it gradually 
clear that statistical considerations concerning the measuring process 
might serve instead as the foundation. 

For a full appreciation of this important state in the development of our 
theory it is worthwhile to review the principal achievements of the trans¬ 
formation theory against the background of the earlier formulations of 
quantum mechanics. In the latter the energy concept played a dominant 
role. Schrodinger’s eigenvalue problem was that of energy eigenvalues. In 
Heisenberg’s matrix mechanics only those q's and p's were solutions of the 
problem of motion for which the energy matrix was diagonal. Nondia¬ 
gonal elements in the solution matrices were associated with the quantum 
transitions and the diagonal elements with the time averages of the repre¬ 
sented magnitude in the stationary states, in accordance with the fact that 
they correspond with the constant terms in the respective Fourier expan¬ 
sions. As observable were regarded primarily the energy values and the 
squares of the nondiagonal elements in matrices which—as the electric- 
dipole matrix—were connected with the emission, absorption, or disper¬ 
sion of light. This particular choice of what was to be regarded as observ¬ 
able was, of course, conditioned by the historical development of the 
theory which originated in the study of periodic motions. In fact, the 
treatment of the free electron, for example, was beyond the conceptual 
apparatus of this formulation of quantum mechanics. The notion of posi¬ 
tion had no place in a calculus of matrix elements which were disguised 
Fourier coefficients. 

The transformation theory introduced the experimentally required 
generalizations by postulating that, in principle, any Hermitian matrix A 
represents an observable quantity a , on a par with energy, and that the 
eigenvalues of A are the possible results of measuring a. Through Dirac’s 
introduction of continuous matrices and Born’s probabilistic interpreta¬ 
tion of Schrodinger’s wave function, the notion of position was retrieved. 
If previously the energy states of an atomic system formed, so to speak, the 
reference system for theoretical predictions, a more general point of view 
was now adopted. For, confining our discussion to the discrete nondegen¬ 
erate case, we may formulate the basic problem of the transformation 
theory, as envisaged by Dirac and Jordan, as follows: If the measurement 
of a quantity a resulted in a m , what is the probability w mk that a subse¬ 
quent measurement of another quantity b will result in b k l This was the 
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answer: Let the matrix A {,) = ( a mn ) = ( S mn a n ) represent a in the a 
scheme or coordinate system 2 (1) , in which it is diagonal, its diagonal 
elements a mn = a m being the possible results of measuring a; let B (2) 
= (b rs ) = ( 8 rs b s ) represent b in the b scheme or coordinate system 2 {2) , 
in which it is diagonal, its diagonal elements b rr = b r being the possible 
results of measuring b\ finally, let U be the unitary matrix which trans¬ 
forms 2 {,) to 2 {2) so that A (2) = U~ X A {l) U= U ~ k x a k U k ^ and 

B (1) = UB {2) U~ l = U ml bf U lm ~ Then, and this is the answer, 


Wmk 



(6.72) 


It was a plausible answer. First of all, since U is unitary, 



(6.73) 


a condition which such a probability has to satisfy. In addition, if in partic¬ 
ular a happens to be the energy, (B {1) ) nn = ^ b l \ U nl | 2 . Hence, the diag- 

/ 

onal term (B (,) ) nn of B (,) in the energy scheme, which corresponds to the 
time-independent term in the Fourier expansion of b and signifies there¬ 
fore the time average of b in the nth energy or stationary state, turns out to 
be the statistical mean or expectation value. 

In the following discussion of the further development of the transfor¬ 
mation theory we shall confine ourselves as far as possible to the formal 
aspects of this development and postpone questions of interpretation and 
epistemological implications as far as possible to a later stage. It may be 
argued that greater consistency would have been gained had we deferred 
also Born’s probabilistic interpretation to this later stage. Since, however, 
the entire development of the transformation theory rested on Born’s 
assumption, its deferment would have made our account rather incom¬ 
prehensible. On the other hand, we believe it to be possible to postpone the 
principles of uncertainty and complementarity and related issues—al¬ 
though they preceded chronologically a great part of the subject matter of 
the following discussion—without seriously impairing the logical coher¬ 
ence of our exposition. 

It was most fortunate for the development of our theory that Heisen¬ 
berg’s appeal 96 to pure mathematicians was answered, at the right mo¬ 
ment, in Gottingen by Hilbert himself. Since he had been consulted on 
numerous mathematical aspects of the theory ever since its inception, 
quantum mechanics was no terra incognita to him. In the fall of 1926 he 
began a systematic study of its mathematical foundations. His assistants 
were Lothar Wolfgang Nordheim, a former pupil of Born, and the twenty- 
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three-year-old John (Johann) von Neumann, who had just arrived in 
Gottingen after having obtained a Ph.D. in mathematics from the Univer¬ 
sity of Budapest and a degree in chemical engineering in Zurich. During 
the winter semester 1926/27 Hilbert also gave a two-hour lecture every 
Monday and Thursday morning on the mathematical foundations of 
quantum mechanics (“Mathematische Methoden der Quantentheorie”), 
a resume of which, edited by Nordheim, was published in the spring of 
1927. 97 

Hilbert’s approach was, of course, that of a pure mathematician. In his 
quest for logical rigor he elaborated Jordan’s semiaxiomatic theory, based 
as it was on the notion of probability amplitudes. He first investigated the 
physical requirements which these amplitudes have to satisfy so as to lead 
to results in agreement with experience, and then searched for an analyti¬ 
cal apparatus in terms of which these requirements could be formulated 
and satisfied. By a sufficiently stringent formulation of the physical stipu¬ 
lations he attempted to reach a unique determination of the analytical 
apparatus. What he had in mind was a conceptual system like that of his 
axiomatization of geometry, where the axioms define the relations be¬ 
tween its primitive notions such as ‘“point,” “straight line,” “plane” in an 
unequivocal manner and where linear algebra provides an analytical ap¬ 
paratus which satisfies these relations; moreover, the analytical appara¬ 
tus, conversely, may become the source for new geometrical theorems. In 
analogy to this procedure, which had proved itself so valuable, Hilbert 
postulated six relations which amplitudes are supposed to fulfill. In order 
to establish the analytical apparatus Hilbert associated with every dynam¬ 
ical variable an operator and regarded the corresponding operator calcu¬ 
lus, soon to be outlined, as the appropriate analytical apparatus. However, 
concluding his prefatory remarks, Hilbert had to concede that in physics, 
contrary to mathematics, the analytical apparatus has often to be sur¬ 
mised before a complete axiomatization of the physical requirements is 
accomplished and that usually only the interpretation of the formalism 
leads to an exhaustive postulation of the physical stipulations. “It is diffi¬ 
cult to understand such a theory,” said Hilbert, “if the formalism and its 
physical interpretation are not strictly kept apart. Such a separation shall 
be adhered to even though at the present stage of the development of the 
theory no complete axiomatization has as yet been achieved. However, 
what is definite by now is the analytical apparatus which will not admit 
any alterations in its purely mathematical aspects. What can, and prob¬ 
ably will, be modified is its physical interpretation for it allows a certain 
freedom of choice.” 98 

Although Hilbert’s contention concerning the finality of the analytical 
apparatus, as we shall see, was soon to be disproved, his conception of a 
separate analytical apparatus through the interpretation of which empiri- 
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cal relations can be brought into the form of mathematical statements 
became a basic element of the methodology of modern theoretical physics. 
Having for many years been convinced of the importance of integral equa¬ 
tions also for theoretical physics," Hilbert assumed that the operators to 
be associated with dynamical variables could be written as integral opera¬ 
tors of the type 



(/O')) = J <pix,y) f {y)dy 


where the kernel of the operator cp T = cp(x,y), provides the probability 
amplitude. Thus, if T is the operator which transforms canonically the 
operator q for the position coordinate into the operator F of the variable 
F (/?,#), a function of q and the momentum p,<p T is the probability ampli¬ 
tude <p(xy;qF) relating to q and F, that is, the probability amplitude that 
for a fixed value y of F(p,q) the coordinate q has a value between x and 
x + dx. If, in particular, F ( p,q) is q itself, or p , the corresponding kernels 
are x8(x —y) and (fi/i)8'(x — y), respectively. Generalizing these re¬ 
sults for any two dynamical variables F x and F 2 , Hilbert and his collabora¬ 
tors studied the conditions that the relative probability density 


w(xy;F x F 2 ) = <p(xy;F ] F 2 )<p*(xy;F l F 2 ) 

is real and nonnegative. They concluded that only Hermitian operators 
are to be admitted. With the help of Dirac’s 5-function they derived differ¬ 
ential equations from the integral-operator equations and showed, in par¬ 
ticular, that the time-independent Schrodinger wave equation 



(p(xW;qH) = 0 


is the functional equation for the probability amplitude for energy and 
position coordinate, and that the time-dependent wave equation 


H 


V i dx) i dt 


<p(xt;qT) = 0 


is obtained if in <p(xy;F x F 2 ) F x is the position operator and F 2 a “time 
operator,” associated with the time t = t{p,q) and conceived as a dynami¬ 
cal variable canonically conjugate to the Hamiltonian H(q,p). 
q){xt\qT) = ijj(x,t) is consequently the probability amplitude that for a 
given time t the position coordinate has a value x between x and x + dx. 
Hilbert, von Neumann, and Nordheim pointed out that their result is in 
full agreement with Born’s probabilistic interpretation. For, according to 
Born, in the general solution of the time-dependent Schrodinger wave 


equation xf/(x,t) = ^ c n i/j n (x) exp[ (i/fi) W n t ], \c„ exp[ (i/ft) W„t ] | 2 
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measures the probability that the atom is found in the nth state and 
| ( x ) | 2 that it is found, while in state n, at x. The probability interference 
principle consequently shows that ifs(x,t) is the amplitude that the atom, 
irrespective of its state, is found at x. 

The Hilbert-Neumann-Nordheim transformation theory, like its pre¬ 
decessors, the theories of Dirac and Jordan, offered a unified formalism in 
which both matrix and wave mechanics were embedded as special cases. 
This unification, as von Neumann soon realized, depended critically on 
the properties of the Dirac 5-functions and was therefore objectionable to 
at least the same extent as were these functions. 

In matrix mechanics, it will be recalled, the basic problem, as Born and 
Jordan already knew, 100 was the solution of the matrix equation 

U~'H(pq)U= W (6.74) 

or 

UW=HU (6.75) 

where the diagonal terms of the diagonal matrix W are the energy eigen¬ 
values of the dynamical system under discussion and where U denotes a 
unitary transformation which diagonalizes H. Using the notation 
H= ( h mn ), U- ( u mn ) - (u^), and W= ( w m S m „ ), one could write 
Eq. (6.75) as follows: 


2 hmkUi*'= AuZ' A = w„ (6.76) 

k 

and regard the elements u k n) of the nth column of U as the components of a 
vector (u k ). The basic problem could consequently be reformulated as 
that of solving the eigenvalue problem 

2 ^(6-77) 

k ' 

for given h mk >. If it is recalled that in wave mechanics the basic problem 
was that of solving the eigenvalue problem 

Hxfj(q) = Ax/j(q) (6.78) 

where H denotes the Hamiltonian operator of the system, the similarity 
between the two equations (6.77) and (6.78) cannot be ignored. It sug¬ 
gests that u m be regarded as a function of the “discrete variable” m just as 
xfj(q) is a function of the continuous variable q. Then the right-hand sides 
of the two equations (6.77) and (6.78) have the same structure. 

By the same token, h mk > should correspond to a function of two vari¬ 
ables, h(q,q '), and the ^ to an integral J • • • dq . In other words, Eq. 

k' 
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(6.77) should be written as 

J Hq,q')x(i{q')dq' =Aif>(q). (6.79) 

Comparison of Eq. (6.79) with Eq. (6.78) yields 

= f h(q,q')ift(q')dq' (6.80) 

which shows that h{q,q ') is an integral kernel of the functional or integral 
operator H of precisely the type used by Hilbert and his collaborators. 
Matrix and wave mechanics can therefore be unified if it proves possible to 
find for every acceptable H a corresponding integral kernel h(q,q ') which 
satisfies Eq. (6.80). Taking for H the identity operator, so that 
Hxft(q) = ip(q), one can easily confirm that 

= \ h(q,q')tp(q')dq' (6.81) 

can be satisfied only if the integral kernel is Dirac’s 6 -function 8(q — q'). 

Von Neumann, now, contended quite generally that a unification of 
matrix theory with wave mechanics by setting up a correspondence be¬ 
tween the “space” Z of the discrete index m in Eq. (6.77) and the space fl 
of the continuous variable q in Eq. (6.78) can be carried out only by 
transforming a differential operator into an integral operator, a process 
which necessitates the introduction of “improper” functions such as the 
Dirac 5-function. 

It should now be recalled that in 1927 the 5-function, in spite of its 
increasing popularity, was mathematically an illegitimate conception. It 
is so still today if it is conceived as an ordinary function. The justification 
given by KirchhofF 101 was faulty simply because there is no limiting func¬ 
tion of ((i/Jit) exp( — fi 2 g 2 ) for/z-» oo. Nor could it be defined as the 
derivative of the Heaviside function Y(x ), for which Y(x) = 0 if x<0, 
T(x) = i if x = 0 , and T(jc) = 1 if x> 0 , because at x = 0 a derivative 
does not exist. In fact, it was not difficult to see that the requirements 
5(x) = 0 for x t^O and 00 8(x)dx = 1 are incompatible irrespective of 
whether the Riemann or the Lebesgue conception of the integral is being 
used. That the difficulty could be overcome by expressing 6 (x) as a 
Stieltjes 102 integral or as a distribution became clear only after 1945, when 
Laurent Schwartz 103 generalized the notion of function, derivative, and 
Fourier transformation by his theory of distributions. Utilizing results 
from the theory of linear topological spaces, Schwartz was able to replace 
the ill-defined 5-function and its derivatives by well-defined linear func¬ 
tionals or distributions which have the immediate advantage of always 
possessing a derivative which is itself a distribution. 
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A rigorous definition of the 5-function could also be obtained by a 
method introduced by Jan G. Mikusinski, 104 who defined generalized 
functions as the closure of certain ordinary functional spaces with respect 
to a weak topology. Roughly speaking, generalized functions were defined 
as certain sequences of ordinary functions, just as real numbers were de¬ 
fined by Cantor as sequences of rational numbers. 105 In contrast to these 
methods, which also include Marcel Riesz’s theory of pseudofunctions, 106 
a different approach has recently been proposed toward a rigorous foun¬ 
dation of the theory of 5-functions. By extending, not the concept of func¬ 
tion but that of number—again in a way similar to Cantor’s closure pro¬ 
cess by embedding real numbers in sequences of such—Schmieden and 
Laugwitz 107 defined infinitely large and infinitely small numbers and 
showed that on the basis of what they called “continuous continuations” 
(“stetige Fortsetzungen”) of real function and “normal functions” 
(“normale Funktionen”), functions can be constructed in a rigorous 
manner which possess all the properties of Dirac’s 5-function. To go into 
details would lead us too far from our subject. We have gone into this 
digression to show that the notion of 5-function, as it has been applied by 
Dirac, Jordan, and Hilbert and his collaborators, can be fully legitimized 
today. But it could not in 1927. 

Arguing that the use of the 5-function leads to “insolvable mathemat¬ 
ical difficulties” (“unlosbare mathematische Schwierigkeiten”), 108 von 
Neumann rejected the proposed fusion of the space Z of the discrete vari¬ 
able m in Eq. (6.77) with the space ft of the continuous variable q in Eq. 
(6.78). Instead, elaborating on certain ideas of Hilbert’s work on linear 
equations, he developed between 1927 and 1929 a new mathematical 
framework 109 of the theory which subsequently proved to be the most 
suitable formalism of nonrelativistic quantum mechanics as we use it to¬ 
day, as well as its extensions, the relativistic quantum mechanics of parti¬ 
cles and the quantum theory of fields. 

The decisive innovation on which von Neumann based his new ap¬ 
proach was his ingenious discovery, early in 1927, that a new formalism of 
quantum mechanics can be established in view of the fact that not Z and 
ft, but rather the sequence space F z over Z and the function space F n over 
ft are essentially identical. Here F z is the set of all sequences { u m 

satisfying | u m \ 2 = 1 and F n the set of all summable and square-integra- 

ble complex-valued functions ifr(q) over ft, satisfying S\^(q)\ 2 dq = 1. 
Von Neumann assumed that the indicated normalizations, which, inci¬ 
dentally, exclude trivial solutions, can always be imposed in the treatment 
of eigenvalue problems of the type envisaged. The sequence space F z , it 
will be recalled, played an important role in Hilbert’s theory of integral 
equations and his related investigations on bounded quadratic forms of 



330 


Conceptual Development of Quantum Mechanics 


infinitely many variables. It was therefore known as the “Hilbert sequence 
space.” Von Neumann’s F n , on the other hand, corresponded to what 
became known as L 2 (H) in functional analysis, that is, the space of all 
summable and square-integrable functions (in the sense of Lebesgue) 
over the domain SI. Von Neumann knew that F z and F n are isomorphic 
and isometric, provided the operations of addition, multiplication by a 
scalar, and the formation of an inner product are appropriately defined. 
For he recalled that twenty years ago Frederic (Friedrich) Riesz, 110 like 
von Neumann a native of Hungary, proved an important theorem which 
in a slightly modernized version can be formulated as follows: if 
{ <p n (x )}f° is a complete orthonormal system in / 2 ((1) = L 2 (a 9 b) and 

{a n } f° a sequence of (real) constants, then the convergence of ^a 2 n is a 

necessary and sufficient condition that there exists in L 2 {a,b) an almost 
everywhere (i.e., up to a range of measure zero) uniquely determined 
function/(x) such that a n = ( x)<p n (x)dx for all n. 

Riesz’s paper was presented to the Gottingen Society of Hilbert on 
March 9, 1907. Four days earlier, at a meeting of the Mathematical So¬ 
ciety of Briinn, Ernst Fischer had proved the following theorem: if a se¬ 
quence of functions all of which belong to L 2 (a 9 b) 9 converges in 

the mean [that is, lim S b {f m —f n ) 2 dx = 0], then there exists in 

m,n = oo 

L 2 {a 9 b) a function/(x) to which they converge in the mean [that is, 
lim S b a (f—f n ) 2 dx = 0]. In a paper 111 presented to the Paris Academie 

n = oo 

des Sciences on April 29,1907, in which Fisher published the proof of this 
theorem, he pointed out that Riesz’s conclusion is an immediate conse¬ 
quence of this theorem; for, he said, using Riesz’s notation, the conver¬ 
gence of a 2 n entails the convergence in the mean of ^a n <p n ; hence there 

exists a function/(x) to which ^ a n <p n ; converges in the mean; but this 
implies that a n = S b a f \f> n dx. 

The “Fischer-Riesz theorem,” as Riesz’s theorem was subsequently 
called, established therefore a one-to-one correspondence between bound¬ 
ed sequences or vectors { u m } and square-integrable functions tp(q). As 
von Neumann realized, this correspondence is linear and isometric, that 
is, if {u m }++ifr(q) and {v m }++<p(q)> then {au m }++aif>(q), 

{u m +v m }++ifr(q) +<p(q), and £ \u m | 2 = S\$(q)\ 2 dq- In view of these 

results von Neumann reasoned as follows: since F z and F n —and not Z 
and fl—are the “real analytical substrata” of matrix and wave mechanics 
and since they are isomorphic, that is, ultimately only different math- 
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ematical representations of the same abstract relations, the equivalence 
between matrix and wave mechanics is a logical consequence of this iso¬ 
morphism. Moreover, he added, a formulation of quantum mechanics 
which retains only absolutely essential relations and dispenses with all 
accidental trappings has to be based on the abstract common structure of 
F z and F n or, as he called it, the “abstract Hilbert space.” In von Neu¬ 
mann’s formulation of quantum mechanics was found the ultimate fulfill¬ 
ment—as far as the theory of quanta is concerned—of Hipparchus’s insis¬ 
tence on the usefulness of investigating “why on two hypotheses so 
different from one another... the same results appear to follow,” 112 
which, as we have seen, had already characterized Jordan’s line of re¬ 
search. 

To this end von Neumann developed an axiomatic theory of the ab¬ 
stract Hilbert space which, defined as an infinite-dimensional complete 
separable linear vector space with a positive definite metric, contained F z 
and F n as special realizations. After a study of the geometrical properties 
of this space von Neumann constructed a theory of linear operators whose 
domains are everywhere dense, abandoning thereby the requirement that 
operators must be defined everywhere. In the realization F a von Neu¬ 
mann’s operators are functionals as studied in functional analysis. Simi¬ 
larly, von Neumann’s defining equation of adjoint operators A and A f , 
namely, 113 

(A'Jg) = (Mg) 

where the parentheses denote the inner product of two vectors in the Hil¬ 
bert space, can be interpreted in the realization F n as the equation 

f (A V )g*dr = f f(Ag) *dr 
Jn Jn 

A is Hermitian if A =A f and A is continuous or bounded if for a constant C 
and for all vectors / for which it is defined 

\\Af\\<C\\f\\ 

where the double bars denote the length (fif) 1/2 of /. A is said to be 
definite if for every fi{Afif) >0; A is unitary if AA f = A f A = /, where I 
is the unity operator satisfying If=fi 

Most of these conceptions were not new. Some of them has been used 
by Frechet 114 about twenty years earlier when he introduced the language 
of Euclidean geometry into the theory of functional spaces; in particular, 
he gave the earliest definition of such notions as separability and com¬ 
pleteness of spaces. Other concepts, such as the length || f\\ or the concept 
of orthogonality, can be traced back to Erhard Schmidt, 115 to Study, 116 
and even to Grassmann. 117 Schmidt was the first to recognize the impor¬ 
tance of projection operators which, as we know, proved so important for 
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von Neumann’s spectral theory. For finite-dimensional spaces such pro¬ 
jections had already been introduced by Grassmann, who, in such spaces, 
had defined also inner products, linear independence, complete orthonor¬ 
mal systems, and linear manifolds. For infinite-dimensional spaces the 
notion of complete orthonormal systems seems to have been used for the 
first time by Gram. 118 The inequality | ( fg) | < || f\\ ||g||, which appears so 
frequently in von Neumann’s theory, had been called, by Poincare 119 in 
1896, “l’inegalite de Schwarz” after its discoverer. 120 Its usefulness be¬ 
came evident after Hellinger and Toeplitz 121 had shown in their study of 
infinite matrices in how powerful a way it can be applied. Finally, the 
notion of Hermitian operators goes back, of course, to that of Hermitian 
forms which had been introducted by Charles Hermite, first 122 for rt = 2, 
and one year later 123 for any finite rt. 

After these historical remarks on some of the most important func¬ 
tional-analytical notions in modem quantum mechanics we shall turn to 
von Neumann’s treatment of the eigenvalue problem. Von Neumann real¬ 
ized that the traditional formulation of this problem Hxp = Aifr and the 
requirement that its solutions form a complete set, a development which 
can be traced back to Sturm 124 and Liouville, 125 cannot be carried over 
into the transformation theory in Hilbert space—the eigenfunction of a 
free particle is no Hilbert vector; he therefore reformulated the eigenvalue 
problem by using a method developed by Hilbert in 1906 in his study of 
integral equations. 126 

To understand von Neumann’s reformulation we shall discuss it first 
for linear vector spaces of N dimensions. Let us assume that the unitary 

N 

matrix U diagonalizes the Hermitian form ( Ax,x) = ^ A mn x*x n to 

n 

N M ( Sr 

X a ky k yt= X ( fl(T) 2 y ( P T) y ( P T) * 

k = 1 r = 1 V p = 1 

where a (T) , with a (l) < a {2) < • • • < a iM) , are the < N) different eigen¬ 
values of A and where a (r) , of g r -fold degeneracy, belongs to the g T vec¬ 
tors y {T) with components y iT) . Using the projection operator F (r) , an TV- 
dimensional matrix whose elements are zero except those corresponding 
to a (T) —which are unity—so that 


Sr 

2 

p = i 





y>y) 


we easily see that 


(Ax,x) = ^a iT \F < - T) y,y) 
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and hence, since y=U l x, 

(A,x) =^a ( - T \UF ( - T ' , U~'x,x). 

T 

Since also UF (r) U ~ 1 is a projector 

( Ax f x) = ^a iT) (E iT) x f x) 

T 

or symbolically 


A = '£a (r) E (T) . 

T 

If, in particular, A is the identity operator /, 

I='£E lT) 

T 

For this reason the range of projectors {E (r) } has been called a “resolu¬ 
tion of the identity.” 127 The result of our discussion for finite-dimensional 
vector spaces can consequently be formulated as follows: to every Hermi- 
tian operator ,4 belongs a (unique) resolution of the identity {E (T) } such 

that ,4 = ^ a (T) E ir) ; that is, ( Ax,x) = ^ a iT) (E (T) x,x) or, more gener- 

r r 

ally, ( Ax,y) = ^a (T) (E {T) x,y) and the eigenvalue problem has been re- 

T 

duced to the determination of the resolution of the identity. 

Von Neumann, now, showed that this approach can be generalized in a 
mathematically rigorous manner to the infinite-dimensional Hilbert 
space. As a result of this generalization, for whose details the reader is 
referred to the original literature, von Neumann could formulate the 
eigenvalue problem as follows: For a given Hermitian operator ,4 a resolu¬ 
tion of the identity {is (A)} has to be found such that (1) 

A = S °1 ^AdEiA), that is, such that (A fg) = SZ oo Ad{E(A) fg) holds 
forallgandall / for which 0<jA 2 d{ ||is(A)/|| ) 2 < + oo; (2) for ', 
E(A)^E(A , ) f that is, E(A’) — E(A) is a projection operator; (3) for 
A-+ -f cdo, E(A for — oo, E(A)f^> 0; (4) A’^A with 
A '>A,E(A ') f^E(A )/; that is, discontinuities occur only to the left. 

For bounded (or, equivalently, continuous) operators the problem 
had been solved by Hilbert, who had shown that to each such operator 
there belongs one and only one resolution of the identity. Since the impor¬ 
tant operators in quantum mechanics, however, are in general unbound¬ 
ed, von Neumann was faced with the problem of extending Hilbert’s spec¬ 
tral theory to the case of unbounded Hermitian operators, a problem 
which he finally solved 128 in 1929 for all Hermitian operators which are 
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maximal, that is, which possess no proper extensions. 129 For he showed 
that (1) every Hermitian operator can be extended to a maximal operator 
and (2) to every maximal operator belongs exactly one resolution of the 
identity or none. 

The new formalism also made it possible for von Neumann to reformu¬ 
late the statistical foundations of quantum mechanics. Referring to Schro- 
dinger’s eigenvalue problem Hxft n = A, n ifr n9 which, for the sake of simpli¬ 
city, he assumed to possess a pure point spectrum, he noticed that the 


A 


resolution of the identity for 77is given by E{ A) f— ^ ) x Pn andfor 

the operator q by 

xfj{q) if q<A 


F(A)xfr(q) = 


0 if q > A 


According to Born Sj \ ip ( q) \ 2 dq is the probability that the position coordi¬ 
nate q of the system in state xp is found in the interval / = [q\q"] and 


^ \S$(q)$*(q)dq\ 2 is the probability that the energy of the system in 

A n el 

state xp is found in the interval I = [A \A " ]. Von Neumann now gave to 
these two probability statements a unified formulation as follows. If it is 
only known that the energy is in /, the probability P that q is in /is given by 

p = X f ^ \F(q',q")ip„(q)\ 2 dq 

A„el Jq' X„€l 

= £ f~ \F(q',q" )W 'A ( q)\ 2 dq 

n = l J — oo 


= X \\F{q',q")EU',A")tl, n {q)\\ 

n = 1 

or symbolically 130 

P= [E(I)F( / ) ]. 

Generalizing this result, von Neumann formulated the basic statistical 
assumption of quantum mechanics as follows: if R lf R 2 ,..., R, form one 
set of (commutative) operators representing dynamical variables and S lf 
S 2 ,. .., Sj another such set, the former having E x (A ),... as their resolu¬ 
tions of the identity and the latter F x ( A ),..., then the probability that the 
values of the S Xf ... lie in the intervals /,,... if the values of the 
R Xf ... lie in the intervals I x ,... is given by 

[E(I X ) • • •^(/,)F(/ 1 ) • • -F(Jj)]. 

This statement, von Neumann concluded, contains all the statistical as¬ 
sertions of quantum mechanics which have been made so far. On the basis 
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of these results he offered a mathematically rigorous formulation of the 
transformation theory which comprised all previous formalisms of quan¬ 
tum mechanics. It will have been observed that this formulation, which 
von Neumann presented in Gottingen on May 20, 1927, did not postulate 
any physical or epistemological assumptions other than those on which 
the transformation theories of Dirac and Jordan were based. In particular, 
the uncertainty principle which Heisenberg had meanwhile (in March, 
1927) proposed did not yet play any part in these considerations. As we 
shall see, however, this principle not only greatly influenced von Neu¬ 
mann’s subsequent elaborations and, in particular, his theory of measure¬ 
ment, but also was of decisive importance for the—at that time generally 
accepted—interpretation of the whole formalism advanced so far; we 
shall therefore have to discuss the uncertainty relations and their episte¬ 
mological implications in greater detail. 

But before we conclude the present chapter, we should add a note of 
caution against an overoptimistic evaluation of von Neumann’s great 
achievement. For there are some serious doubts whether a formulation of 
quantum mechanics, based on a separable Hilbert space in which, conse¬ 
quently, every orthonormal basis is denumerable, suffices to encompass 
the solutions of all physically significant problems. In particular, the ap¬ 
plication of von Neumann’s theory to systems with continuous energy 
states has led to the systematic use of functions not normalizable in the 
sense of quadratic integrability. Although alternative methods of norma¬ 
lization have been developed (Weyl, Fues, and others 130a ), no canonical 
set of the resulting wave packets could be established to serve as an ortho¬ 
normal basis in a space of quadratically integrable functions. To overcome 
such difficulties, inner-product function spaces of solutions of the Schro- 
dinger equation for free and scattered particles, admitting plane, cylindri¬ 
cal, and spherical waves, have been constructed; 131 but—in contrast to 
von Neumann’s approach and its foundation for the statistical interpreta¬ 
tion—in these spaces an extension theorem (the possibility of expressing 
the basic functions of one space in terms of the basic functions of any 
other) does not generally hold. Furthermore, a mathematically rigorous 
treatment of the scattering problem in terms of the so-called scattering 
operator or the S matrix could so far not be worked out in a satisfactory 
manner within the context of von Neumann’s formalism in spite of the 
importance of the problem, although during the past ten years some 
promising results along this line have been achieved. 132 Finally, it is well 
known that for systems described by a quantized field theory von Neu¬ 
mann’s separability condition is not satisfied. 
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CHAPTER 7 

The Copenhagen Interpretation 


7.1 The Uncertainty Relations 

With the establishment of the statistical transformation theory the for¬ 
malism of nonrelativistic quantum mechanics was completed in all its 
essential points. But a formalism, even if complete and logically consis¬ 
tent, is not yet a physical theory. To reach this status, some of its symbols 
have to be given an operationally meaningful interpretation (or epistemic 
correlation in the sense of Carnap’s “phenomenal-physikalische Zuord- 
nung” 1 ). For unless a formalism is linked with certain data of sensory 
experience in such a way that both the beginning and the end of a chain of 
theoretical deductions are anchored in experience, it is not verifiable or 
falsifiable by experiment or observation and consequently not a physical 
theory. 

Apart from Bom’s probabilistic interpretation of the wave function— 
the idea of associating expectation values with average results of measure¬ 
ments, though already implicitly referred to by Bom, as we have seen, was 
not yet fully worked out—no epistemic correlations had so far been estab¬ 
lished between the quantum-mechanical formalism and experience. Even 
if Bohr’s frequency relation was considered as part of the formalism, or 
rather of its logical conclusions, and spectroscopic confirmation as meth¬ 
odologically unobjectionable, the formalism could not yet be regarded as 
fully interpreted. For even if certain conclusions of formal deductions 
could be compared with experience, the early stages of these deductions 
did not have any clear-cut correlations or associations with sense data. 
The formalism was, so to speak, interpreted, at best, only at its end. This 
unusual state of affairs was due, of course, to the peculiar development of 
the formalism which at first was dominated by the correspondence princi¬ 
ple but later detached from it. 

The situation, however, was still worse. Terms like “place,” “velocity,” 
“orbit” continued to play an important part in the representation of the 
formalism although their very rejection lay at the foundation of the whole 
development. In any case, it was clear—as the basic commutation relation 
pq — qp = h /2m incontestably evinced—that these concepts, if at all le¬ 
gitimate, could not simply have their classical significance. Hence, even 
Bom’s interpretation, based as it was on the conception of the localizabi- 
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lity of a particle or its “place,” was, strictly speaking, only an ignotum per 
ignotius. In short, there still remained the open questions: how to interpret 
the new formalism and, in particular, how to put its symbols or mathemat¬ 
ical expressions into correspondence with the concepts of classical phys¬ 
ics—for only in this way could operationally meaningful correlations be 
established. 

The urgency of these problems became particularly evident when 
Schrodinger visited Copenhagen in September, 1926, and lectured on 
wave mechanics at Bohr’s Institute. With his characteristic insistence on 
an all-pervasive continuity in microphysics he attacked Bohr’s views on 
discontinuity and his basic conception of quantum jumps. In spite of pro¬ 
longed discussions—some debates began in the morning and ended at 
night—no agreement between the disputing sides could be reached. 2 Fin¬ 
ally, when Bohr referred even to Einstein’s paper 3 of 1916 (or 1917) on 
the transition probabilities as supporting his view that without such dis¬ 
continuities Planck’s radiation law could not be accounted for, Schro¬ 
dinger reportedly 4 exclaimed: “If one has to stick to this damned quantum 
jumping, then I regret having ever been involved in this thing”; where¬ 
upon Bohr replied: “But we others are very grateful to you that you were, 
since your work did so much to promote this theory.” The Bohr-Schro- 
dinger clash of opinion was highly provocative and stimulated animated 
discussions in Copenhagen even long after Schrodinger’s departure. 5 

Heisenberg realized that the root of this conflict was the lack of a 
definite interpretation of the quantum-mechanical formalism. Recogniz¬ 
ing on grounds similar to those just mentioned that classical notions like 
“position” or “velocity” cannot be employed in microphysics in the same 
sense as they were in macrophysics, Heisenberg compared the situation in 
quantum mechanics with that which would have prevailed in relativity if 
the formalism of the Lorentz transformations had been combined with a 
language based on the notions of space and time in their prerelativistic 
meaning. In fact, Einstein’s reinterpretation of the concepts of space and 
time, by which the Lorentz transformations 6 were given a consistent oper¬ 
ational meaning, served Heisenberg as an example of how to overcome the 
present difficulties in quantum mechanics. Just as Einstein reversed the 
question and—instead of asking how nature can be described by a math¬ 
ematical scheme—postulated that nature always works so that the math¬ 
ematical formalism can be applied to it, Heisenberg asked himself: “Well, 
is it not so that I find in nature situations which can be described by 
quantum mechanics?” 7 “Then I asked,” Heisenberg continued, “ ‘What 
are those situations which you can define?’... Then I found very soon that 
these are situations in which there was this Uncertainty Relation between 
p and q. Then I tried to say ‘Well, let us assume there is only this possibility 
of having A/? Aq^h /hr. Does this make a consistent statement? Can I 
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then prove that my experiments never give anything different?’ ” That 
Heisenberg’s retrospective reflections on the conceptual background of 
his discovery of the uncertainly principle are correct and are not projec¬ 
tions of later ideas, and that his famous thought experiments for the deter¬ 
mination of the position of an electron played, indeed, the same method¬ 
ological role as Einstein’s operational definition of the simultaneity of 
spatially separated events, will be fully corroborated by our analysis of 
Heisenberg’s paper. 

For a full comprehension of Heisenberg’s approach it is necessary to 
understand his assessment of the conceptual situation at the end of 1926. 
The formalism of quantum mechanics, he reasoned, which operates in 
abstract multidimensional spaces and employs noncommutative quanti¬ 
ties does not admit of ordinary space-time descriptions or causal connec¬ 
tions of physical phenomena. And yet, he pointed out, the intuitive space- 
time descriptions had been carried over from classical physics into 
quantum mechanics and were being applied indiscriminately. Heisenberg 
saw in this disparity between formalism and intuitive conceptions the root 
of serious difficulties and contradictions. Realizing, however, that it 
would be impossible to construct an independent appropriate conceptual 
apparatus (in the sense of a descriptive language) which would give an 
adequate intuitive interpretation of the abstract formalism, since all our 
conceptualizations are inseparably connected with spatiotemporal ideas, 
Heisenberg saw no other alternative than to retain the classical intuitive 
notions but to restrict their applicability. 

In a letter to Pauli on October 28, 1926, Heisenberg had already writ¬ 
ten that it makes no sense to speak of a monochromatic wave at a definite 
instant or extremely short interval and that, as the basic commutation 
relation clearly shows, it is “meaningless to speak of the place of a particle 
with a definite velocity.” 8 “But if one does not take it too seriously with 
the accuracy in using the notions of velocity and position, then it may well 
make sense.” 9 Four months later, after much meditation on these prob¬ 
lems, Heisenberg arrived at certain conclusions which he communicated 
to Pauli in a 14-page letter, dated February 23, 1927. Encouraged by 
Pauli’s enthusiastic reaction, 10 he wrote a comprehensive essay on the 
contents of this letter, sent it to Pauli for approval, and afterward present¬ 
ed it to Bohr. Bohr, who had just returned from a vacation in Norway, was 
not quite satisfied with the paper and suggested some modifications. But 
Heisenberg insisted on his original version and referred to Bohr’s sugges¬ 
tions only in a postscript to his paper. Finally, at the end of March, he 
submitted his paper on the intuitive contents of the quantum-theoretic 
kinematics and mechanics, 11 in which he advanced his famous uncertain¬ 
ty principle, to the editor of the Zeitschrift fur Physik. 
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The uncertainty principle or, as it is also called, the indeterminacy 
principle had its origin in the Dirac-Jordan transformation theory. In fact, 
some of its qualitative aspects had already been announced by the propon¬ 
ents of this theory. Thus, in the introduction to his formulation of this 
theory, Dirac , 12 referring to conjugate variables, declared: “One cannot 
answer any question on the quantum theory, which refers to numerical 
values for both the q and the p. One would expect, however, to be able to 
answer questions in which only the q or only the p are given numerical 
values, or, more generally, when any set of constants of integration £ that 
commute with one another are given numerical values.” And Jordan , 13 it 
will be recalled, arrived at a similar conclusion: “For a given value of q all 
values of p are equally possible.” 

Whereas Dirac and Jordan were fully aware, as these statements indi¬ 
cate, that in quantum mechanics, contrary to classical physics, assign¬ 
ments of sharp values both of q and of p are incompatible with each other, 
Heisenberg investigated the quantitative relation between the theoretical¬ 
ly permissible distributions of such values. In other words, he inquired 
what information can be derived from the transformation theory on the 
relation between the statistical distribution of q values and that of p values. 

To answer this question he assumed that the position coordinate q has a 
distribution around q' with an uncertainty Sq = q x the latter being defined 
as that distance from q' at which the squared probability amplitude 
| S(y,q) | 2 , for a fixed parameter 77 which need not be further character¬ 
ized, falls toe -1 of its maximum height. Hence, assuming the dependence 
on q to be that of a Gaussian error curve, Heisenberg could write 


S(y,q) 1 2 = const exp 





and for S(rj,q) itself 

S(y>q) = const exp 
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he replaced, following Jordan , 14 S(q,p) by exp(2mpq/h) so that 
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where txp(2mp'q’/h) has been absorbed into the new constant. By writ¬ 
ing 
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S(rj,p) = const exp 
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the integral, extended from — oo to + oo, being a constant. With p x 
defined by the equation 
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he could write 
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which by comparison with the first equation of this derivation showed 
that p x has the same significance for p as q x had for q . Thus, Heisenberg 
concluded, the product of the uncertainties Sq and Sp is h /2i r, or 

Sq 8p — . (7.1) 

Itt 

“The more accurately the position is determined, the less accurately the 
momentum is known and conversely.” 15 In this conclusion Heisenberg 
saw the “direct intuitive interpretation” (“direkte anschauliche Erlaute- 
rung”) of the basic commutation relation pq — qp = h /2m. 

Heisenberg now asked himself whether this reciprocal limitation of 
precisions is merely a restriction imposed by the mathematical formalism 
of the transformation theory or whether it is a reflection of a more pro¬ 
found state of affairs. More precisely, he asked “whether it is not possible 
by a more profound analysis of the kinematical and mechanical concep¬ 
tions to resolve the contradictions...and to obtain an intuitive understand¬ 
ing of the quantum-mechanical relationships.” 16 

Heisenberg was hoping that an analysis of the notions of position and 
velocity, or rather their reinterpretation, would do for quantum mechan¬ 
ics what Einstein’s famous analysis of the concept of simultaneity did for 
the mechanics of high-speed motions; and just as Einstein resolved there¬ 
by the contradictions in prerelativistic physics so, Heisenberg believed, his 
analysis would remove the difficulties of atomic physics. Starting his anal¬ 
ysis with the notion of position, Heisenberg declared: “If one wants to 
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clarify what is meant by ‘position of an object’ (‘Ort des Gegenstandes’), 
for example, of an electron”—and he added ‘‘relatively to a given refer¬ 
ence system”—‘‘he has to describe an experiment by which the ‘position 
of an electron’ can be measured; otherwise this term has no meaning at 
all.” 17 Heisenberg now outlined his well-known thought experiment with 
the y -ray microscope, an idea which had occupied his mind as early as 
1924, that is, well before he discovered matrix mechanics, as he once 
confided to one of his pupils. 18 At an early date he had also discussed such 
a possibility with Borchert Drude, who like his father, Paul Drude, spe¬ 
cialized in optics. 19 It is also interesting to note in this context that the 
problem of the resolving power of the microscope was one of the questions 
(the others were on the resolving power of the Fabry-Perot plates and on 
the theory of the storage battery) which Wien had asked but Heisenberg 
failed to answer in his oral Ph.D. examination at the University of Munich 
in 1923. It seems that Heisenberg was conscientious enough to study these 
subjects after the examination. He himself said: ‘‘One might even say that 
in this later work on the y -ray microscope and the uncertainty relation I 
used the knowledge I had acquired by this poor examination.” 20 Still, in 
his discussion of the experiment he did not take into account the angular 
aperture of the objective, an omission which Bohr soon corrected. 

In principle, Heisenberg contended, one can determine the position or 
coordinate q to any desired degree of accuracy. One has only to illuminate 
the electron with radiation of sufficiently short wavelength and to observe 
it under the microscope. Owing to the Compton effect, however, the im¬ 
pact of the light quantum upon the electron changes the latter’s momen¬ 
tum abruptly, ‘‘this change being the greater, the smaller the wavelength 
of the light used, that is, the more accurately the position is determined.” 21 
Heisenberg’s omission of the angular aperture of the microscope lens 
made his description of the experiment somewhat vague as to the ultimate 
reason of the uncertainty in momentum. The reason why on the basis of 
the theory of the Compton effect the change in momentum cannot be 
sharply determined is, of course, that the direction of the scattered photon 
cannot be determined within the bundle of rays entering the microscope. 

Analyzing a Stern-Gerlach experiment, Heisenberg showed in a simi¬ 
lar vein that the precision in the measurement of energy is smaller, the 
shorter the time spent by the atom in crossing the deviating field, or 

8E8t~h (7.2) 

In view of these results Heisenberg could declare: ‘‘All the concepts 
that are used in the classical theory for the description of a mechanical 
system can also be defined exactly for atomic processes. But the experi¬ 
ments which lead to such definitions carry with them an uncertainty if 
they involve the simultaneous determination of two canonically conjugate 
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quantities.” 22 In fact, it is precisely this uncertainty, as Heisenberg point¬ 
ed out, which makes the use of classical notions admissible. 

Even a notion like “path” or “trajectory” (“Bahn”), which presup¬ 
poses a sharp knowledge of both position and momentum, can be retained 
in quantum mechanics, contended Heisenberg. For “the path comes into 
existence only when we observe it,” 23 he declared. This contention, one of 
the most provocative statements ever made in physics, was justified by 
him as follows. If an atom which, say, is in its 1000th excited state is 
illuminated by light of relatively long waves—this suffices because the 
dimensions under discussion are large—the electron, owing to the Comp¬ 
ton effect, may be found after the interaction somewhere between the 
950th and the 1050th state, the particular orbit being indeterminate. The 
electron will therefore be represented by a wave packet in configuration 
space, composed of the eigenfunctions of the states just mentioned, and of 
a size determined by the precision of the position measurement, that is, by 
the wavelength of the illuminating light. The packet describes an orbit 
analogous to that of a classical particle, but spreads with time. Every new 
observation by which a certain q is selected out of a multitude of possibili¬ 
ties reduces the otherwise spreading wave packet again to the dimensions 
of the wavelength of the incident light. The constant replacement in each 
consecutive observation of the wave packet, which has spread out, by a 
smaller packet gives rise to the orbit as the temporal sequence of the 
locations where the wave packet has been observed. 

From these results Heisenberg drew a conclusion of far-reaching philo¬ 
sophical implications at the end of his paper: “We have not assumed that 
the quantum theory, unlike classical physics, is essentially a statistical 
theory in the sense that from exact data only statistical conclusions can be 
inferred. For such an assumption is refuted, for example, by the well- 
known experiments by Geiger and Bothe. However, in the strong formu¬ 
lation of the causal law ‘If we know exactly the present, we can predict the 
future’ it is not the conclusion but rather the premise which is false. We 
cannot know, as a matter of principle, the present in all its details.” 24 And 
Heisenberg concluded: “In view of the intimate connection between the 
statistical character of the quantum theory and the imprecision of all 
perception (‘Ungenauigkeit aller Wahrnehmung’) it may be suggested 
that behind the statistical universe of perception there lies hidden a ‘real’ 
world (‘eine “wirkliche” Welt’) ruled by causality. Such speculations 
seem to us—and this we stress with emphasis—useless and meaningless 
(‘unfruchtbar und sinnlos’). For physics has to confine itself to the formal 
description of the relations among perceptions.” 25 

For a full comprehension of Heisenberg’s reasoning the following re¬ 
marks should be noted. The purely mathematical result Sq Sp = h /2i r [ or 
formulas (7.3) and (7.4), which follow], that is. the fact that the product 
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of the errors or standard deviations of the variables is equal to, or exceeds, 
a certain positive number, does not in itself entail that these variables 
cannot be simultaneously measured with perfect precision—as examples 
of two random variables defined on sets of macroscopic objects, such as 
the height and the weight of the members of a given population, clearly 
show. Although suggested by two important ideas—namely, (1) the tacit 
assumption of the perfect similarity (or in a more modem terminology 
“identity of states of preparation” as in a pure case) of the particles under 
discussion and (2) the recognition of an unavoidable interaction between 
the object and the measuring apparatus—Heisenberg’s identification of 
formula (7.1) with an uncertainty relation, as he understood it, had not 
the character of a logically necessary conclusion. Ultimately, like every 
interpretation of a mathematical relation, it was an act of associating 
symbols with operations and thus contained a certain element of arbitrari¬ 
ness. 

One may be tempted to relate this arbitrariness to the fact that in the 
years 1925 and 1926 questions concerning the limitations of the sensibility 
of measuring instruments were much discussed. In 1925 Moll and Burger 
constructed their thermal relais technique, 26 which proved very useful for 
the microphotometry in experimental spectroscopy; discussing the sensi¬ 
bility of galvanometers, they declared: “By the application of this tech¬ 
nique the sensibility of galvanometers can be increased indefinitely.” 27 At 
the same time, however, the disturbing effects of Brownian fluctuations 28 
were already fully recognized. 29 In fact, in the summer of 1926, F. Zer- 
nike 30 gave a series of guest lectures on this subject on Gottingen Universi¬ 
ty, and these lectures received a considerable amount of attention. 31 We 
must emphasize, however, that we have found no evidence of any direct or 
indirect influence of these lectures or discussions upon Heisenberg. 

Heisenberg’s omission of taking into account the angular aperture of 
the microscope gave rise to an often repeated misconception. Although 
his description of the experiment mentioned the deflection, of the reflected 
light quantum by the lens of the microscope, it did not emphasize the 
indeterminacy of the direction of its reflection but put the stress instead 
upon the discontinuous change of the momentum of the electron. This 
was somewhat misleading, for it suggested that this discontinuous mo¬ 
mentum change, as an uncontrollable interaction between the object (the 
electron) and the measuring device (the incident radiation), produces the 
uncertainty. It should be understood, however, that in the Compton ef¬ 
fect, as the Bothe-Geiger and Compton-Simon experiments have shown, 
the discontinuous change of momentum can be rigorously calculated, 
provided the direction of the reflected quantum is accurately known. 
Now, only because this direction cannot be ascertained more accurately 
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than up to the finite angle of the aperture may one speak of an uncontrolla¬ 
ble interaction. 

At the time of the publication of Heisenberg’s paper, the philosophical 
problems which it raised were not immediately apprehended—problems 
such as whether the uncertainties are of an ontological nature (that is, 
whether the particle just does not have a determinate position or velocity) 
or whether they are only of an epistemological nature (that is, whether 
they are due to the crudeness of the measuring process or, in any case, exist 
only in our knowledge). From Heisenberg’s remarks concerning the no¬ 
tion of “path” or “trajectory” it may be inferred that he would have 
favored the first alternative. At the present stage philosophical problems 
of this kind were not yet so much in the limelight of interest as were the 
methodological questions concerning the definability and measurability 
of the concepts of quantum mechanics. Heisenberg’s basic approach, 
viewed from this angle, was at that time to regard something as defined if 
it is measurable. In short, definability was based on, or reduced to, mea¬ 
surability. Furthermore, according to Heisenberg’s paper, light was fo¬ 
cused not only in a measurement, but also on measurement as such: it 
directed the attention of atomic physics to the importance of the concept 
of measurement which from now on was to assume a central position in all 
questions relating to the foundations of quantum mechanics. Finally, our 
assessment of Heisenberg’s paper would not be complete if we omitted 
mentioning that his proposed solution of the problem of causality, his idea 
that the unascertainability of exact initial values obstructs predictability 
and, consequently, deprives causality of any operational meaning, was an 
outstanding contribution to modern philosopy. In fact, modern philoso¬ 
phy, as Schlick 32 later admitted, was taken by surprise since even the mere 
possibility of such a solution had never been anticipated in spite of the 
profusion of discussions on this problem for generations. 

It took some time until philosophy fully appreciated the implications of 
Heisenberg’s paper for the problem of causality. In fact, even its impact on 
physics was at first not generally recognized, although the paper, unlike 
the one on matrix mechanics of July, 1925, 33 was reviewed immediately 
after its publication in the Physikalische Berichte 34 and somewhat later in 
Science Abstracts, 35 Apart from Bohr and Pauli, who, as we know, were 
intimately acquainted with Heisenberg’s ideas, only Kennard, then visit¬ 
ing Copenhagen, fully understood from the beginning the far-reaching 
significance of the uncertainty relations. Already in July, 1927, Kennard, 
in his previously mentioned review article, 36 called the relations “the core 
of the new theory” (“der eigen tliche Kern der neuen Theorie”) . 37 He also 
generalized Heisenberg’s derivation to the case of any two canonically 
conjugate variables and pointed out that the particular choice of the Gaus¬ 
sian probability distribution gives the optimal limit of precision. In addi- 
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tion, he declared that one can no longer speak, as in classical physics, of 
the true value of a physical quantity since what hitherto was regarded as 
an “error of observation” has now become an integral part of the theory. 
In his well-known encyclopedia article 38 Pauli began his exposition of the 
general principles of quantum theory with the statement of the uncertain¬ 
ty relations. There are indications that it was also due to Pauli that Her¬ 
mann Weyl’s Theory of Groups and Quantum Mechanics 39 was the first 
treatise on quantum mechanics which assigned to the uncertainty rela¬ 
tions—which Weyl on a suggestion by Pauli derived on the basis of the 
Schwarz inequality—an integral part in the logical structure of the whole 
theory. C. G. Darwin was probably the first to recognize the connection 
between the uncertainty relations and the inversion of the Fourier inte¬ 
gral, as his paper “The electron as a vector wave” 40 discloses. 

As may have been expected, the uncertainty principle was regarded in 
certain quarters as a challenge to experimental ingenuity, and numerous 
thought experiments were devised to disprove it. Thus, for example, at the 
Nashville meeting of the American Physical Society on December 30, 

1927, Ruark 41 suggested the following idealized experiment. Two slits, a 
great distance d apart, are equipped with movable shutters. At the time t x 
the first slit is opened for an interval r and a freely moving particle is made 
to pass through it. If it is found that the particle moves at such a rate that it 
passes also through the second slit, which is opened at the time t 2 likewise 
for an interval r, then, contended Ruark, since d and t 2 — t x can be deter¬ 
mined with an arbitary precision, and since r may be taken sufficiently 
minute, a simple calculation would show that Heisenberg’s uncertainty 
relation is violated. This, however, Ruark declared, is not the case; Hei¬ 
senberg’s “indetermination principle”—he was the first to call it a “prin¬ 
ciple”—remains valid because “the measuring devices are statistical ag¬ 
gregates of atoms and are themselves subject to fluctuations.” A few 
weeks later, at the New York meeting of the Society on February 24—25, 

1928, Ruark 42 revised his argumentation and declared: “The velocity of 
the particle changes when it passes the first slit, for it may be considered as 
a group of waves and the frequency of each harmonic train in the group is 
changed by the modulation due to the shutter. This involves a change of 
energy, that is, a modified velocity.” 

Stimulated by Ruark’s ideas, Kennard, 43 having returned at the end of 
1927 from Copenhagen to Cornell, studied in great detail how what he 
called the “shutter effect” changes the speed of an electron and concluded 
in analogy with the Fourier resolution of a finite wave train that this 
velocity is predictable only in a statistical sense. It is interesting to note 
that neither Ruark nor Kennard recognized in this context that the uncer¬ 
tainty principle does not refer to the past. In fact, from the idealized 
experiments which they considered it could easily be inferred that if the 
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position of an electron is precisely measured, after an exact measurement 
of its velocity, the product Sp Sq has a value much smaller than h. But this 
value, as Heisenberg 44 soon pointed out in his Chicago lectures, “can 
never be used as an initial condition in any calculation of the future prog¬ 
ress of the electron and thus cannot be subjected to experimental verifica¬ 
tion. It is a matter of personal belief,” he declared, “whether such a calcu¬ 
lation concerning the past history of the electron can be ascribed any 
physical reality or not.” 45 There was, however, one important result to 
which Ruark was led by his study of these thought experiments. Heisen¬ 
berg, it will be recalled, declared that a dynamical variable such as the 
position of a particle can be measured, in principle at least, to any desired 
degree of accuracy although at the price of the accuracy of the simulta¬ 
neous measurement of its conjugate variable. Ruark, 46 now, was one of the 
first to challenge this statement. He pointed out, for example, 47 that in the 
y -ray experiment the wavelength of the incident radiation cannot be made 
smaller than the increase of wavelength due to the Compton effect which, 
for a scattering angle of 90°, is h /me. Similar results, though on the basis of 
relativistic considerations, were obtained also by Flint and Richardson. 48 

While Ruark and Kennard discussed primarily the experimental 
aspects of Heisenberg’s principle, Condon and Robertson studied its 
mathematical relation to the formalism of quantum mechanics. In his 
effort to find a general formulation of the principle which was assumed to 
hold for any two noncommutative operators, Condon 49 encountered the 
following difficulties: (1) the noncommutativity of two operators^ and B 
does not imply that the product of their corresponding uncertainties 
A A -A B f which Condon, following Weyl, defined by the variance (or the 
square of the standard derivation) in accordance with the formula 


(A A) 2 



must be greater than, or equal to, some positive lower limit; (2) even if A 
and B do not commute, certain values of the quantities represented by 
these operators may both be known precisely; (3) there exist states with 
regard to which A and B commute although the values of their corre¬ 
sponding quantities cannot be known with unlimited precision. To sub¬ 
stantiate (1) Condon considered A L zf the uncertainty of the z compo¬ 
nent of the angular momentum, for the hydrogen eigenfunction xp nlm 
= R(r) Xexp(im<p)P™(cos 0); since L z = mfi, he concluded that 
A L x A L z = 0 in spite of the fact that L X L Z — L Z L X ^0. As to (2) he 
chose the state function so that L x = L y = L z = 0 and their uncer¬ 
tainties vanish. Finally, as to (3) he pointed out that for tf; nl0 (for which 
L z = 0) L x L y — L y L x = ifiL z implies that L x and L y commute although 
A L x 9^0 and A L v ^0. 
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A few weeks later Robertson, a colleague of Condon at Princeton, 
clarified the situation. He showed 50 that the uncertainty principle can be 
generally formulated for any two Hermitian operators if in the formula¬ 
tion reference is made to the wave function of the state under considera¬ 
tion. For he proved that for a given normalized function if/ 0 and two Her¬ 
mitian operators A and B 

k All. (7.3) 

47 T 

where C = (2m/h)(AB — BA), C 0 = S ip*Cip 0 dr, (A A) 2 = S ip* 
(A — A 0 ) 2 ip 0 dr, A 0 = f ipoAtf; 0 dr, etc., or in a slightly modernized no¬ 
tation 51 

(^A)(^B)>\\(D)^\ (7.3') 

where iD = AB — BA. For the special case A = p and B = q, Robertson 
concluded, C = C 0 = 1 and 

AgA/?> (7.4) 

4 IT 

in full agreement with WeyFs result. Ditchburn 52 soon clarified complete¬ 
ly the relation of the inequality (7.4), which he called “Pauli’s inequali¬ 
ty,’’ with Heisenberg’s original “equality” (7.1) by pointing out that in 

accordance with the general theory of errors A q = 8q/fi and similarly for 
p and by proving rigorously (with the assistance of J. L. Synge) that the 
limiting condition of equality in (7.4) is obtained if and only if the scatter 
is a Gaussian distribution. 

Schrodinger, as we know from his correspondence with Bohr 53 and 
Einstein, 54 examined very carefully various implications of the uncertain¬ 
ty principle, such as its relation to the discernibility of the discrete energy 
levels of the atoms of an ideal gas in an enclosure. 55 When in the spring of 
1930 he was studying the problem of how to distribute, in an optimal 
simultaneous measurement of p 0 and q 0 at an instant t 0 , the unavoidable 
uncertainty h /\rr between the two variables in such a way that at a given 
later instant t the uncertainty A q in position will be minimal, Sommerfeld 
drew his attention to the papers by Condon and Robertson. Schrodinger 
immediately realized that their result could be improved. For he showed 56 
that for any two Hermitian operators A and B in any state if) 


( AA) 2 {AB) 2 > (- 


An + BA 


-AB + 


Kl-o) 


where A = f if>*Aif> dr, etc. Although the first additive squared term on 
the right-hand side of (7.5) may vanish in particular cases, as in the case 
of all optimal simultaneous measurements of canonically conjugate vari- 
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ables, Schrodinger’s formula (7.5) provided, in general, for the lower 
limit of the product of the uncertainties a value greater than previously 
obtained. 

In the early thirties, the far-reaching implications of Heisenberg’s prin¬ 
ciple for physics as well as for epistemology were fully recognized, and the 
uncertainty principle became an issue of numerous discussions both by 
philosophically minded physicists and by philosophers interested in the 
foundations of science. 563 C. G. Darwin called it a “great achievement,” 57 
whereas von Laue 58 refused to accept it as a final barrier in the search for 
deeper causes; K. R. Popper 59 charged Heisenberg with having tried “to 
give a causal explanation why causal explanations are impossible.” Some 
even saw in Heisenberg’s indeterminacy principle a way to solve the long¬ 
standing philosophical conflict between the doctrines of free will and de¬ 
terminism. “If the atom has indeterminacy,” argued Eddington, 60 “surely 
the human mind will have an equal indeterminacy; for we can scarcely 
accept a theory which makes out the mind to be more mechanistic than 
the atom.” Lucretius’ theory of the exiguum clinamen principiorum , 61 his 
doctrine that free will in man is made possible by the “minute swerving of 
the elements, ” enjoyed an unexpected revival on the basis of modem 
quantum mechanics. 

To discuss these questions in detail would lead us too far from our 
subject. But we wish to point out that the uncertainty principle still con¬ 
tinues to play an important role in current research on the foundations of 
quantum mechanics. Thus, to mention only one example, a rigorous refor¬ 
mulation of quantum mechanics as a general statistical dynamical theory, 
that is, as a theory in which also the transformation with time of the 
probability distribution is a purely stochastic process, 62 leads, within the 
framework of ordinary probability concepts, as Suppes 63 has recently 
shown, to the conclusion that a genuine joint probability distribution for 
two variables, such as position and momentum, does not always exist. In 
other words, “not only are position and momentum not precisely measur¬ 
able simultaneously, they are not simultaneously measurable at all.” 633 

Having discussed the development of Heisenberg’s principle and its 
implications concerning the notion of causality, as conceived by the physi¬ 
cists at that time, we are now in a position to discuss also its implications 
with regard to the notions of identity, indistinguishability, and identifiabi- 
lity of physical objects, notions which came to the forefront of scientific 
interest as soon as the empirical spectra of polyelectronic atoms became 
the subject of quantum-mechanical investigations. 

Our remarks 64 on Pauli’s exclusion principle with which our topic is 
intimately related apply also at present: the subject, bearing upon physics 
and philosophy alike, seems to have not yet been fully explored from the 
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epistemological and logical point of view. Since the revision of these no¬ 
tions which played an important role in the establishment of quantum 
statistics led to far-reaching philosophical conclusions, a brief digression 
on their conceptual development, as far as nonrelativistic quantum me¬ 
chanics is concerned, seems appropriate. 

Let us then start with the notion of “identical” or, as we prefer to call 
them in this section, “like” 643 particles, that is, particles exactly equal in 
all their qualities or physical specifications. The problem whether such 
like entities really exist in nature has a long history of its own. Most 
philosophies of nature which, like the Pythagorean doctrine of atoms, 
based their systems on conceptions of combinatorial discreteness affirmed 
the idea whereas those schools of thought which espoused the principles 
of continuity and determinism as their theoretical foundations denied it. 
Whether Democritus, according to whom “atoms differ in shape and 
size,” taught that atoms of one kind are strictly alike, as so often alleged, 
seems not to stand the test of documentary evidence. Aristotle’s descrip¬ 
tion of Democritean atoms, for example, as having “all of them the same 
nature just as if each one separately were a piece of gold, 646 does not decide 
the issue. 

There is no doubt, however, that the Stoics, with their belief in continu¬ 
ity and determinism, categorically denied the existence of like entities. In 
their view two physical objects, however indistinguishable they may ap¬ 
pear, must intrinsically differ from each other, for otherwise no sufficient 
reason would explain the difference of their positions in space. Thus, for 
example, Cicero, who became acquainted with the ontological realism of 
the Stoics through his contact with Diodotus, Antichus of Ascalon, and 
Posidonius, quoted them as saying that “no hair or no grain of sand is in all 
respects the same as another hair or grain of sand.’ ,64c Lucius Annaeus 
Seneca, the leader of Roman Stoicism, wrote in a similar vein: “...amid all 
this abundance there is no repetition; even seemingly similar things are, on 
comparison, unlike. God has created all the great number of leaves that 
we behold: each, however, is stamped with its special pattern. All the 
many animals, none resembles another in size—always some difference! 
The Creator has set himself the task of making unlike and unequal things 
that are different.” 64 " 1 Stating that the Academics “confuse things indis¬ 
tinguishable by their attributing one and the same quality to two 
substances,” 64 * Plutarch introduced the term “indistinguishability” 
(dirapaAAagia) into philosophical discourse, a term which already ap¬ 
peared in the writings of Philodemus of Gedara. 6 ^ Plotinus, 64 ® the 
founder of the Neoplatonic school in Rome, Athanasius 6411 in his polemics 
against the Arian doctrine of “homoousia” (coequality of Father and 
Son), Giordano Bruno, 641 Nicholas Malebranche, 6 ^ and many more 
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thinkers of the past concurred with the Stoics on this issue, though on 
different grounds. 

The most pregnant formulation of this viewpoint, based on an argu¬ 
ment most akin to that of the Stoa, 641 * was given by Leibniz in his princi- 
pium identitatis indiscernibilium : “no two objects in nature can differ only 
in number .” 641 In his fourth letter to Samuel Clarke 64 " 1 Leibniz declared: 
“There is no such thing as two individuals indiscernible from each other.” 
For otherwise, he argued, no sufficient reason could account for the fact 
that object A , is in place a , and object A 2 in a 2 rather than A , in a 2 and A 2 
in a v 

To discuss in detail the impact of Leibniz’s principle of the identity of 
indiscemibles upon later thought would lead us too far astray. To Kant’s 
criticism of the principle as “spurious” 64 " Salomon Maimon took excep¬ 
tion, arguing that “the difference in external temporal-spatial relations 
has its origin in the difference of internal qualities....Two drops of water 
could not appear in two places unless they differed in intrinsic qualities; it 
is only the result of the incompleteness of our conceptions that we do not 
realize this difference .” 640 For more recent reactions we refer the reader to 
the literature on this subject . 6415 

Turning now to those schools of thought which affirmed the existence 
of like physical entities, we find that this view has only rarely been explicit¬ 
ly professed. This, of course, is not surprising if we recall that for many 
centuries Aristotelianism dominated scientific thought. In fact, prior to 
the revival of the atomistic doctrine in the seventeenth century the fore¬ 
most proponents of this view were the Mutakallimun, the philosophers of 
the Kalam, 6 ^ whose atomistic doctrine renounced causal determinism 
and adopted instead a transcendental principle of constant divine interfer¬ 
ence. Their conception of atoms, as described by Moses Maimonides, one 
of the principal sources for our knowledge of the Kalam, was: “All these 
atoms are perfectly alike; they do not differ from each other in any 
point. ,,64r John Dalton, the originator of modern atomism in the early 
nineteenth century, used almost the same words when he said that “the 
ultimate particles of all homogeneous bodies are perfectly alike .” 648 

By ascribing to all atoms of the same chemical element the same quali¬ 
ties, modern classical physics rejected Leibniz’s principle as we see. In 
striking contrast to Maimon’s conception, local position was no longer 
regarded as a clue to qualitative differentiation, an idea which was strong¬ 
ly supported by the theory of relativity. “An atom of oxygen remains the 
same whether it descends in the brook into the valley or rests in the lake, 
whether it rises in the vapor or drifts along in the air, whether it is inhaled 
in the breath or is absorbed in the blood: and in each of these locations it 
can be replaced by every other atom of its kind,” declared Windelband in 
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1910 in a statement characteristic of the conceptual situation of that 
time. 64 * 

The atomic theory of the nineteenth and early twentieth centuries ac¬ 
cepted, as we see, the qualitative identity of particles—but denied their 
indistinguishability! If with the Stoa and Leibniz determinism, in its logi¬ 
cal attire as the principle of sufficient reason, led to the denial of qualita¬ 
tive identity, in modem classical atomism determinism, in its physical 
attire as the laws of motion, led to the denial of indistinguishability. For it 
was claimed that two particles, once “told apart,” can always be “told 
apart,” for they can always be reidentified, thanks to the uniqueness of the 
solutions of the equations of motion. If the motion of each particle is 
represented by its four-dimensional world-line in space-time, a family of 
nonintersecting lines is obtained each of which can be used to characterize 
or label uniquely the particle whose trajectory it is. It was therefore 
thought it is always possible, at least in principle, to decide unambiguous¬ 
ly whether or not a particle found at a certain time at a certain place is the 
same as a particle found at another time at the same or another place. In 
short, particles could be “alike,” but were never “indistinguishable.” 
Even in the absence of an intrinsic difference of quality the extrinsic differ¬ 
ence of kinematical behavior was a sufficient criterion for their identifiabi- 
lity; it derived its operational meaning from the assumption of an unres¬ 
tricted possibility of establishing an unbroken connection between the 
object at a time t x with the object at a time t 2 “by continuous observation 
(either direct or indirect) through all intermediate time. ,,64u 

Quantum mechanics, as we now shall see, led to a far-reaching revision 
on this point. In the light of our preceding remarks it is interesting to note 
that Heisenberg, one of the first quantum physicists to question the doc¬ 
trine of determinism, was also the first to recognize the importance of the 
notion of like particles for the new scheme of conceptions. Already in the 
spring of 1926 he wrote: “It is a characteristic trait of atomic systems that 
their constituent parts, the electrons, are equal and subject to equal 
forces. ,,64v To show how to express this “trait” in exact mathematical 
terms he formulated the Hamiltonian for the system under discussion (a 
classical system of two coupled oscillators) as a symmetric function of th§ 
coordinates and parameters (mass, frequency) of the constituent 
parts. 64w 

At about the same time Dirac 64 " 1 proposed a new quantum-mechanical 
approach to the statistical treatment of a system of like particles, though 
as yet without taking account of the spin, and Fermi 64 *' arrived indepen¬ 
dently at similar results. The Fermi-Dirac statistics, as is well known, 
allowed only for antisymmetric functions, in contrast to the Bose-Einstein 
statistics 642 for like particles and the classical Boltzmann statistics for 
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distinguishable particles. In November, 1926, Heisenberg 65 was already 
in a position to apply these ideas to the helium atom with the inclusion of 
spin. At the same time Fowler 65a reexamined the whole problem ab initio. 
In the attempt to arrive at “a quite general form of statistical mechanics of 
which the classical form and Einstein’s and Fermi-Dirac’s are special 
cases,” Fowler defined like—or, as he called them, “identical”—particles 
as “having the same set of energy values.” 656 Having thus obtained the 
status of a fundamental notion in the new scheme of ideas, the conception 
of like articles was soon fully vindicated by Heisenberg, 65c Hund, 65d and 
Dennison. 656 Through his correct interpretation of the change in the rota¬ 
tional specific heat of the hydrogen molecule. Dennison could show that 
protons, just like electrons, are subject to the Fermi-Dirac statistics or the 
exclusion principle and have the spin Wigner 65f meanwhile arrived at a 
successful approach to the general case of N like particles and its relations 
to group-theoretical considerations, a relation—first suggested to him by 
von Neumann—which since then became one of the most important con¬ 
ceptual instruments for the study of elementary particles. 

In the summer of 1927 the importance of the ideas of exchange degen¬ 
eracy and exchange energy for our understanding of atomic spectra was 
already clarified. In June, Heitler and London 658 published their famous 
paper on the hydrogen molecule in which they showed the existence of a 
new kind of saturable, nondynamic forces, the so-called “exchange 
forces” of attraction or repulsion between like particles, and developed a 
schematic theory of the homopolar valence 656 which eventually brought 
the whole of chemistry under the sovereignty of quantum mechanics. 

These results not only lent weight to the concept of like particles; they 
also showed that like particles may be indistinguishable, that is, may lose 
their identity, a conclusion which follows from the uncertainty relations 
or, more precisely, from the impossibility of keeping track of the individ¬ 
ual particles in the case of interactions of like particles. For, contrary to 
classical dynamics, trajectories could no longer be defined as sharp nonin¬ 
tersecting world-lines but had to be conceived as overlapping each other. 
In fact, all papers on exchange phenomena and, in particular, the calcula¬ 
tions concerning the ground state of the helium atom, in which the wave 
functions of the two electrons overlap completely, showed clearly that the 
classical principle of an unrestricted identifiability of particles had to be 
abandoned. Moreover, it was possible to show that Heisenberg’s uncer¬ 
tainty principle was already contradicted by the idea of an approximately 
continuous sequence of atomic-configuration measurements, designed to 
identify electrons in lower-energy states and hence requiring positional 
uncertainties smaller than average electron distances. In addition, the 
success of the theory of the scattering of like particles, as first suggested by 
Oppenheimer 65 ' and subsequently developed by Mott 65j with the well- 
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known interference term, characteristic for the exchange interaction and 
leading to results at variance with classical calculations, and the agree¬ 
ment with experience of such calculations concerning the effective cross 
section constituted, so to speak, an experimental verification of the unten- 
ability of the principle of unrestricted identifiability. 

It must be admitted, however, that these far-reaching conclusions, al¬ 
though implicitly contained in almost all papers on this subject, were only 
seldom explicitly stated by their authors. It was rather a topic discussed in 
the writings of the more philosophically interested physicists. Thus, 
Kellner, in his papers previously referred to, 65k characterized the new 
statistics as “precluding the reidentification (‘Wiedererkennbarkeit’) of a 
particle if once one has lost track of it”; Langevin discussed this matter in 
his address to the Paris meeting of the French Society for Physical Chem¬ 
istry in 1933; 651 and, in the English-speaking world, C. G. Darwin’s writ¬ 
ings, especially his at that time widely circulated book on the new concep¬ 
tions of matter, 65 " 1 attracted the attention of philosophers to these results. 

It should also be noted that the renunciation of identifiability or the 
assumption of total likeness of particles was not always unreservedly ac¬ 
cepted. One of the main objections, raised by logicians and physicists 
alike, was the following argument, recently restated, for example, by Ar¬ 
thur March: “In order to count them (these particles) as two or more, 
there must be something intrinsic by which they differ from each 
other.’’ 65 " Number or, more precisely, “being numerable,’’ we may an¬ 
swer, is not necessarily a property of the individuals to be counted; it is 
rather a property of the system composed by them. In fact, electrons in 
atoms are never counted, but it is shown that the mathematical conse¬ 
quences of tentatively assigning a number to the system which they com¬ 
pose either agrees or disagrees with experience. It is in this sense that 
particles, though being alike, indistinguishable, and even unidentifiable, 
may still be counted collectively. With these remarks we conclude our 
digression. 


7.2 Complementarity 

The often repeated statement that Bohr derived the notion of complemen¬ 
tarity from Heisenberg’s uncertainty relations is erroneous from both the 
historical and the conceptual points of view. Heisenberg’s uncertainty 
relations were for Bohr rather a confirmation of conceptions he had been 
groping for long before Heisenberg derived his principle from the Dirac- 
Jordan transformation theory. True, Heisenberg’s work prompted Bohr 
to give his thoughts on complementarity a consistent and final formula- 
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tion, but these thoughts, as we shall see presently, can be traced back at 
least to July, 1925. 

Although very little is known about the genesis of these ideas in Bohr’s 
mind—from July, 1925, until September, 1927, the most dramatic period 
in the development of the modern quantum theory, Bohr had published 
very little on quantum physics and still less on his more intimate philo¬ 
sophical ideas—one thing seems to be certain: Bohr’s conception of com¬ 
plementarity originated from his final acceptance of the wave-particle 
duality. His formerly inflexible opposition to Einstein’s light quanta— 
which found such an eloquent expression in his answer to Einstein’s letter 
shortly after the publication of the Bohr-Kramers-Slater paper, 66 and as a 
result of which, at least in part, he advocated the highly unconventional 
conceptions as expressed in this paper—suffered a serious blow through 
the results of the Bothe-Geiger experiments. 663 

An early evidence of Bohr’s acknowledgment of the wave-particle du¬ 
ality and of his first ideas on how to cope with it is contained in a four-page 
postscript (“Nachschrift”), dated July, 1925, to a paper 67 which treated 
the interaction between atomic systems still along the lines of the Bohr- 
Kramers-Slater approach. After having written the article (pp. 142-154), 
Bohr became acquainted with the Bothe-Geiger paper which had mean¬ 
while appeared and found in it a convincing proof that in the Compton 
scattering the emission of the recoil electron is coupled with the emission 
of the photoelectron, ejected by the scattered radiation—in blatant con¬ 
trast to the assumption of the independence of atomic transitions as pro¬ 
posed in the Bohr-Kramers-Slater paper. This assumption, Bohr now de¬ 
clared in the postscript, had to be made because no spatiotemporal 
mechanism seemed conceivable which could have admitted such a cou¬ 
pling and at the same time would have conformed to the classical electro¬ 
dynamics which had always proved itself so useful for the description of 
optical phenomena. “It must, however, be stressed,” he declared, “that 
the question of a coupling or of an independence of the individually ob¬ 
servable atomic processes cannot simply be regarded as a differentiation 
(‘Unterscheidung’) between two well-defined conceptions concerning 
the propagation of light in empty space, corresponding perhaps to a cor¬ 
puscular or undulatory theory of light. It involves rather the problem how 
far spatiotemporal pictures in terms of which physical phenomena have 
hitherto been described can be applied to atomic processes....In view of 
the recent results one should not be surprised if the required extension of 
classical electrodynamics leads to a far-reaching revolution of the concep¬ 
tions (“eine durchgreifende Revolution der Begriffe”) on which the de¬ 
scription of nature has been based so far.” 68 Bohr was convinced that the 
particle picture as suggested by the Bothe-Geiger experiments had to be 
somehow harmonized with the basic assumption of the Bohr-Kramers- 
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Slater paper, according to which the radiative aspects of atomic interac¬ 
tions have to be described by the wave picture. Facing a similar dilemma 
in his treatment of the capture of electrons by fast a particles, subject to 
the conservation of energy, Bohr declared that “we are compelled to ac¬ 
knowledge in this phenomena a new trait which is not describable in terms 
of spatiotemporal pictures.” 69 And he added that one has to be prepared 
to envisage processes “which are incompatible with the properties of me¬ 
chanical models...and which defy the use of ordinary space-time models 
just as the coupling of individual 70 processes in distant atoms defy a wave 
description of optical phenomena.” 71 

From now on, the wave-particle dilemma seems to have engaged 
Bohr’s attention constantly. In a lecture on “Atomic Theory and Wave 
Mechanics” 72 delivered before the Copenhagen Academy on December 
17, 1926, he discussed this issue at some length. After the lecture he con¬ 
fided to fMffding that he became “more and more convinced of the need 
of a symbolization if one wants to express the latest results of physics.” 73 

In the debates following Schrodinger’s visit to Copenhagen and later, 
especially in his controversy with Heisenberg on the meaning of the uncer¬ 
tainty relations, he rejected the latter’s more formal approach and regard¬ 
ed the wave-particle duality as the ultimate point of departure for an 
interpretation of the theory. He disliked Heisenberg’s idea that “nature 
imitates a mathematical scheme” 74 and his one-sided preference for the 
intuitive particle conception of classical physics whose applicability 
should be limited only as far as necessary in order to conform with the 
quantum-mechanical formalism. 

Heisenberg’s remark 75 that “Bohr developed his ideas on complemen¬ 
tarity” while vacationing in Norway early in 1927 implied that it was 
during this period that Bohr associated his general ideas on the wave- 
particle duality with what he subsequently called the quantum postulate 
according to which an essential discontinuity or not-further-analyzable 
individuality, a notion completely foreign to classical thought and symbo¬ 
lized by Planck’s quantum of action, has to be attributed to every atomic 
process. 

For Bohr, in contrast to Heisenberg, it was not the formalism as such 
but rather its underlying logic which should be given priority in the at¬ 
tempt to reach an appropriate interpretation of the theory. Thus, with 
regard to the basic equations 

E=hv and p = hk (7.6) 

which express the proportionality between energy E and momentum p 
with the frequency v and wave number k , respectively, Bohr asked himself 
how it is possible that these equations relate together characteristics of 
radiation which, strictly speaking, are contradictory to each other: par- 
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tide attributes energy and momentum, necessary for the description of the 
interaction of radiation with matter, as in the photoeffect or in the Comp¬ 
ton effect, and frequency and wave number, necessary for the description 
of the propagation of light, as in the phenomena of interference and dif¬ 
fraction. Planck’s constant, Bohr realized, establishes in these equations a 
relation between two descriptions of radiation that are mutually exclusive, 
but equally necessary. 

It was this mutual exclusion and, at the same time, indispensability of 
fundamental notions and descriptions which led Bohr to the conclusion 
that the problem with which quantum physics found itself confronted 
could not be solved by merely modifying or reinterpreting traditional 
conceptions. What was needed, he concluded, was a new logical instru¬ 
ment. He called it “complementarity,” denoting thereby the logical rela¬ 
tion between two descriptions or sets of concepts which, though mutually 
exclusive, are nevertheless both necessary for an exhaustive description of 
the situation. In Heisenberg’s reciprocal uncertainty relations he saw a 
mathematical expression which defines the extent to which complemen¬ 
tary notions may overlap, that is, may be applied simultaneously but, of 
course, not rigorously. The uncertainty relations, Bohr contended, tell us 
the price we have to pay for violating the rigorous exclusion of notions, the 
price for applying to the description of a physical phenomenon two cate¬ 
gories of notions which, strictly speaking, are contradictory to each other. 
In these relations he also saw a confirmation of his thesis that complemen¬ 
tarity would never lead to logical contradiction in spite of the fact that the 
notions involved are logically exclusive of each other; for, as the uncer¬ 
tainty principle shows, the sharp exhibition of one of such complementary 
notions necessitates an experimental setup which differs totally from that 
required for the other, it being the essence of the uncertainty principle that 
no physical situation can arise which exhibits simultaneously and rigor¬ 
ously (sharply) both complementary aspects of the phenomenon. 

The possibility of using mutually contradictory notions for the descrip¬ 
tion of the same physical situation, argued Bohr, arises from the indeter¬ 
minateness of the concept “observation.” For the interaction between the 
object of observation and the agency of observation—which interaction in 
accordance with the quantum postulate cannot be neglected as it could in 
classical physics—makes it impossible to separate sharply the behavior of 
the atomic system from the effect on the measuring instrument whose 
behavior must be expressed in classical terms. By combining the atomic 
system with different classically describable devices one may measure 
complementary variables, and by expressing the results of these measure¬ 
ments in classical terms one may describe an atomic system in terms of 
complementary classical pictures. 
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In view of our discussion on the philosphical background of nonclassi* 
cal interpretations (Sec. 4.2) it will be understood that Bohr’s conception 
of complementarity was strongly influenced both by IMffding’s philo¬ 
sophical teachings and by James’s psychological writings. In an article 
published on the occasion of Htfffding’s eighty-fifth birthday 76 Bohr ac¬ 
knowledged the impact of Htfffding’s ideas on his work and, in particular, 
on his search for an appropriate interpretation of the quantum-mechani¬ 
cal formalism. 77 

Bohr’s insistence on the indeterminateness of the concept of observa¬ 
tion, in so far as it depends upon which objects are included in the system 
to be observed and which in the agency that observes, showed a striking 
similarity to James’s analysis of the notion of observation in psychology. 
The influence of James, whose psychological writings, as we know, Bohr 
had studied with great interest, may be inferred also from the following 
fact: Bohr concluded his first paper 78 on the notion of complementarity 
with the statement that “the idea of complementarity is suited to charac¬ 
terize the situation, which bears a profound analogy to the general diffi¬ 
culty in the formation of human ideas, inherent in the distinction between 
subject and object’’—a statement which clearly referred to one of the main 
issues dealt in James’s Principles of Psychology. Similarly, in a subsequent 
paper 79 in which Bohr resumed his discussion on the impossibility of a 
strict separation of the phenomenon from the means of its observation, he 
reemphasized the analogy between physics and psychology:“Strictly 
speaking, the conscious analysis of any concept stands in a relation of 
exclusion to its immediate application,’’ 80 and he admitted explicitly: 
“The necessity of taking recourse to a complementary, or reciprocal, 
mode of description is perhaps familiar to us from psychological prob¬ 
lems.” 81 Even an allusion to the “Stream of Thought,” 82 a chapter in 
which James attempted to disprove Locke’s atomism of ideas and by 
which, according to von Weizsacker, 83 Bohr was particularly impressed, 
may be found at the end of this paper, where Bohr declared: “In particu¬ 
lar, the apparent contrast between the continuous onward flow of associ¬ 
ative thinking and the preservation of the unity of the personality exhibits 
a suggestive analogy with the relation between the wave description of the 
motions of material particles, governed by the superposition principle, 
and their indestructible individuality.” 84 

Before we conclude our discussion on the development of Bohr’s com¬ 
plementarity conception, it seems to be worthwhile to make a brief digres¬ 
sion on the term “complementary” itself. This may throw some light on 
the meaning it had when Bohr adopted it as a technical term. Its earliest 
use in science and also its earliest occurrence in the scholastic curriculum 
is undoubtedly the geometrical expression “complementary angles.” Al¬ 
though, as we know, 85 Bohr was deeply interested in the foundations of 



The Copenhagen Interpretation 


365 


geometry and in the relations of mathematics to language, it was probably 
not this geometrical expression but James’s application of the term which 
had an impact on Bohr’s mind. In his discussion of hysteric diseases 
James 86 described an experiment performed by Pierre Janet, 87 the noted 
psychologist and neurologist of the Salpetriere, who like Freud was a 
pupil of J. M. Charcot and specialized in the field of disordered personal¬ 
ity. Studying a patient called Lucie, Janet covered her lap with cards, each 
bearing a number; he then told her that on waking, after the present state 
of trance, she should not see any card whose number is a multiple of 3. 
“When she was awakened and asked about the papers on her lap, she 
counted and said she saw only those whose number was not a multiple of 
3. To the 12,18,9,etc., she was blind. But the hand , when the sub-con¬ 
scious self was interrogated by the usual method of engrossing the upper 
self in another conversation, wrote that the only cards in Lucie’s lap were 
those numbered 12,18,9,etc., and on being asked to pick up all the cards 
which were there, picked up these and let the others lie.’’ 88 James quoted 
this experiment as an illustration of the fact that “in certain persons the 
total possible consciousness may be split into parts which coexist but 
mutually ignore each other, and share the objects of knowledge between 
them. More remarkable still, they are complementary .” 89 Few things are 
more curious,’’ wrote James a few pages later, “than these relations of 
mutual exclusion....’’ 90 In James’s psychology the sum total of comple¬ 
mentary parts, as exemplified by Janet’s experiment, makes up the normal 
totality; in Bohr’s physics the sum total of complementary descriptions 
makes up the description of classical physics. 

On September 16, 1927, in the auditorium of the Istituto Carducci in 
Como, where the International Congress of Physics convened in comme¬ 
moration of the hundredth anniversary of Allessandro Volta’s death 
(Volta was born and died in Como), Bohr delivered a lecture on “The 
Quantum Postulate and the Recent Development of Atomic Theory’’ 91 in 
which he disclosed for the first time in public his ideas on complementar¬ 
ity. Addressing an audience of leading physicists from all parts of the 
world, 92 Bohr began his lecture with the words: “I shall try by making use 
only of simple considerations and without going into any detail of techni¬ 
cal mathematical character to describe to you a certain general point of 
view which I believe is suited to give an impression of the general trend of 
the development of the theory from its very beginning and which I hope 
will be helpful in order to harmonize the apparently conflicting views 
taken by different scientists.’’ 93 After stressing the contrast between the 
classical description, based on the assumption that the phenomenon dis¬ 
cussed may be observed without appreciable disturbance, and the descrip¬ 
tion of quantum phenomena, subject to the quantum postulate, according 
to which to any atomic process an essential discontinuity, or rather indivi- 
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duality, has to be attributed, Bohr declared: “On one hand, the definition 
of the state of a physical system, as ordinarily understood, claims the 
elimination of all external disturbances. But in that case, according to the 
quantum postulate, any observation will be impossible, and, above all, the 
concepts of space and time lose their immediate sense. On the other hand, 
if in order to make observation possible we permit certain interactions 
with suitable agencies of measurement, not belonging to the system, an 
unambiguous definition of the state of the system is naturally no longer 
possible, and there can be no question of causality in the ordinary sense of 
the word. The very nature of the quantum theory thus forces us to regard 
the space-time coordination and the claim of causality, the union of which 
characterizes the classical theories, as complementary but exclusive fea¬ 
tures of the description, symbolizing the idealization of observation and 
definition respectively.’’ 94 This statement, in which the term “comple¬ 
mentary’’ appeared for the first time and in which spatiotemporal descrip¬ 
tion is referred to as complementary to causal description, contained the 
essence of what later became known as the “Copenhagen interpretation’’ 
of quantum mechanics. It is therefore appropriate to analyze this state¬ 
ment in some detail. The “state of a system,’’ as understood, for example, 
in ordinary mechanics, is the set of all the (position) coordinates and the 
momenta of the constituent parts and implies the possibility of using these 
data for the prediction (or retrodiction) of the structural properties of the 
system. Prediction (or retrodiction), however, is possible only if the sys¬ 
tem is closed, that is, unaffected by external disturbances. For an open 
system, strictly speaking, no “state’’ can be defined. Now, the evolution of 
the structural properties of the system is governed by the law of causality; 
their progress in time is the causal behavior of the system. But according 
to the quantum postulate, in atomic physics, any observation of the sys¬ 
tem implies a disturbance. In other words, a system, if observed, is always 
an open system. A space-time description, however, presupposes observa¬ 
tion. Hence, concluded Bohr, the claim of causality excludes spatiotem¬ 
poral description and vice versa. 95 The usual causal space-time descrip¬ 
tion, that is, the simultaneous use of complementary descriptions, is made 
possible in classical physics, argued Bohr, merely because of the extremely 
small value of the quantum of action “as compared to the actions involved 
in ordinary sense perceptions.’’ 

To give a first example of the breakup of the classical causal space-time 
description into its complementary parts, Bohr contrasted the wave theo¬ 
ry of light, and adequate description of the propagation of light in space 
and time, with the theory of light quanta which accounts for the interac¬ 
tion with matter in terms of energy and momentum. The mutual exclusion 
of the notions of wave and particle precludes, according to Bohr, a causal 
space-time description of optical phenomena. “The two views of the na- 
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ture of light are rather to be considered as different attempts at an inter¬ 
pretation of experimental evidence in which the limitation of the classical 
concepts is expressed in complementary ways.” 

After showing that the same situation exists with respect to the descrip¬ 
tion of the constituents of matter (corpuscles vs. de Broglie waves), Bohr 
pointed out that “the fundamental contrast between the quantum of ac¬ 
tion and the classical concepts is immediately apparent from the simple 
formulas which form the common foundation of the theory of light quanta 
and of the wave theory of material particles.” Writing the Einstein-de 
Broglie relations in the form 

Er = pk = h (7.7) 

where E and p are the energy and momentum, respectively, as used in the 
particle picture, and r = 1/v and k = \/a the period of vibration and the 
wavelength, respectively, as used in the wave picture, Bohr showed their 
relation to the ordinary mode of description by referring to the relativistic 
equations 


=>1^ 

II 

ft, 

(7.8) 

vdp = dE 

(7.9) 


where v denotes the velocity of the particle and c the velocity of light. 
Hence, Bohr pointed out, the phase velocity, which is given by v/a , turns 
out to be c 2 /v , and the group velocity, which according to de Broglie is 
defined by dv/da , coincides with v. “The possibility of identifying the 
velocity of the particle with the group velocity indicates the field of appli¬ 
cation of space-time pictures in the quantum theory. Here the comple¬ 
mentary character of the description appears, since the use of wave groups 
is necessarily accompanied by a lack of sharpness in the definition of 
period and wavelength, and hence also in the definition of the correspond¬ 
ing energy and momentum as given by relation (7.7).” For in accordance 
with the well-known Fourier analysis of wave trains 

A t Av = A* A a = 1 (7.10) 

where A t and A* denote the extension of the wave packet in time and 
space. But then, by (7.7), 

A t AE=Ax A p = h (7.11) 

which clearly shows that any contraction of the wave packet, that is, a 
sharper definition in the space-time description, is accompanied by an 
increase in the indefiniteness of the energy-momentum components, that 
is, a less precise definition in the causal description. This general recipro¬ 
cal relation between the maximum sharpness of definition of, relativist!- 
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cally speaking, the space-time vector and that of the energy-momentum 
vector was for Bohr “a simple symbolical expression for the complemen¬ 
tary nature of the space-time description and the claims of causality.” 
Bohr then discussed, from the viewpoint of complementarity, Heisen¬ 
berg’s uncertainty relations and the meaning of measurement in quantum 
theory, and pointed out that an adequate tool for complementary modes 
of description is offered precisely by the new formalism of quantum me¬ 
chanics. For this formalism, he contended, is essentially a purely symbolic 
scheme which permits only predictions, in accordance with the corre¬ 
spondence principle, of results obtainable under conditions specified in 
terms of classical concepts. 

In the discussion, 96 in which Born, Kramers, Heisenberg, Fermi, and 
Pauli participated, no objections were voiced—but the real issue of the 
paper was not touched upon. Those who heard Bohr’s ideas for the first 
time found it probably too difficult to comprehend their full meaning. 
Thus, for example, Leon Rosenfeld, who became one of the most eloquent 
proponents of the complementarity interpretation 97 and whom Bunge, 98 
in his critique of this approach, singled out as the most representative 
member of this school of thought, admitted, 99 referring to Bohr’s lecture: 
“I did not see, I did not feel, any of the subtlety that was in it.” Nor did the 
Gottingen theorists appreciate its meaning at that time. Wigner, for exam¬ 
ple, as reported by Rosenfeld, 100 remarked that Bohr’s lecture “will not 
induce any one of us to change his opinion about quantum mechanics.” 
And according to Wigner, 101 von Neumann had this to say on Bohr’s 
lecture: “Well, there are many things which do not commute and you can 
easily find three operators which do not commute.” What von Neumann 
alluded to was, of course, Bohr’s contention that the complementarity of 
the wave and particle pictures and the consequent limitation in the appli¬ 
cation of their corresponding conceptions are adequately reflected in the 
noncommutativity of certain operators. Von Neumann’s criticism may 
perhaps remind the student of philosophy of a similar question which had 
been asked in connection with Spinoza’s theory of attributes and its sub¬ 
jective interpretation by Erdmann. For, like Bohr’s “descriptions of na¬ 
ture,” Spinoza’s attributes 102 are intellectual modes of perception and, 
though not satisfying reciprocal relations such as those that are satisfied 
by Bohr’s complementary descriptions, do exclude each other. The episte¬ 
mological problem: Why does the human mind recognize only two attri¬ 
butes (res cogitans and res extensa) of the infinite substance which accord¬ 
ing to Spinoza has an infinitude of attributes? offers an interesting 
analogue to von Neumann’s sally—although it must, of course, be kept in 
mind that the ontological status of Spinoza’s substance differs radically 
from that which Bohr ascribed to the object of physical description. 



The Copenhagen Interpretation 


369 


It is interesting to note—although it would be rash to draw from it any 
far-reaching conclusions—that the Physikalische Berichte published two 
separate reviews of the substance of Bohr’s lecture, a very short one, writ¬ 
ten by Elsasser, 103 and an exceptionally long one of almost two pages, 
written by Smekal, 104 whereas Science Abstracts did not mention it at all! 
Physicists, engaged in the application of the new formalism to still un¬ 
solved problems in atomic physics, were too busy to direct their attention 
to questions of intepretation, and philosophers were generally still lacking 
technical knowledge to participate in the debate. 

It will have been noted that in his Como lecture Bohr did not explicitly 
define complementarity. The first to give a precise definition of this notion 
was Pauli. In his article for the Handbuch derPhysik 105 Pauli called two 
classical concepts—and not two modes of description—complementary 
“if the applicability of the one (e.g., position coordinate) stands in the 
relation of exclusion to that of the other (e.g., momentum)” in the sense 
that any experimental setup for measuring the one interferes destructively 
with any experimental setup for measuring the other. 106 Pauli, as we see, 
in contrast to Bohr, ascribed complementarity to two notions which be¬ 
long to the same classical mode of description (e.g., the particle picture) 
and not to two mutually exclusive descriptions. It was this notion of com¬ 
plementarity which was in his view the characteristic feature of quantum 
mechanics which he therefore proposed to call the “theory of complemen¬ 
tarity” (“Komplementaritatstheorie”) in analogy to the “theory of rela¬ 
tivity.” C. F. von Weizsacker, who like Pauli fully espoused the idea of 
complementarity—he once called it “the key to the presently best possible 
undentanding of quantum theory” 107 —recognized the subtle divergence 
between Bohr and Pauli and suggested a distinction between what he 
called “parallel complementarity” and “circular complementarity.” One 
should speak of the former if, as in Pauli’s example—and by his defini¬ 
tion—two concepts (as “position coordinate” and “momentum”) are 
complementary in quantum theory without having been so in classical 
physics or if, as in Hund’s example of complementarity, 108 two concepts 
(as “position” and “wave number”) are complementary in quantum me¬ 
chanics having been so also in classical physics. On the other hand, Bohr’s 
conception of the complementarity between space-time description and 
causal description should be called “circular complementarity.” For, von 
Weizsacker contended, it is only via a detour to wave mechanics and the 
deterministic Schrodinger equation of the wave function that a causal 
description obtains; but if we wish to arrive at a space-time description, a 
measurement of classical observables has to be performed which leads to a 
statement in terms of space and time; such a reduction of the wave packet, 
however, destroys the deterministic behavior of the function, as we know. 
“The complementarity between space-time description and the claim of 



370 


Conceptual Development of Quantum Mechanics 


causality,” wrote von Weizsacker, “is therefore precisely the complemen¬ 
tarity between the description of nature in classical notions and in terms of 
the ^ function.” 109 Bohr, as we know from his correspondence with von 
Weizsacker 110 on this issue, rejected this interpretation and declared that 
his expression “claim of causality” referred to the application of the no¬ 
tions of energy and momentum and their conservation principles which 
determine the course of events. 

We have mentioned these divergent conceptions of complementarity as 
an illustration of the conceptual difficulties which the Bohr interpretation 
of quantum mechanics had to face. Part of these difficulties originated 
undoubtedly from a certain vagueness of expression which the Copenha¬ 
gen interpretation exhibited, at least during its early stages. It has also 
been claimed, however, that it was precisely due to this vagueness and its 
concomitant conceptual flexibility that the Copenhagen interpretation 
managed to survive serious crises. According to Feyerabend, one of the 
reasons “for the persistence of the creed of complementarity in the face of 
decisive objections is to be found in the vagueness of the main principles of 
this creed,” 111 this ambiguity being perhaps one of the reasons of its fruit¬ 
fulness, as Groenewold 112 had pointed out. In this case, it may be said, a 
characteristic tenet of the Kierkegaard-Hdffding philosophy, which, as 
we have tried to show, 113 had an impact upon Bohr, would have vindicat¬ 
ed itself. 

Bohr himself, it seems, regarded the first enunciation of his new con¬ 
ceptions more as a program for further elaboration than as a definite 
statement of an immutable dogma. He was fully aware that both the foun¬ 
dations and the implications of his ideas had still to be subjected to critical 
investigations. It was therefore for him “a most valuable incentive ... to 
reexamine the various aspects of the situation as regards the description of 
atomic phenomena...” and “a welcome stimulus to clarify still further the 
role played by the measuring instruments” 114 when, a few weeks after the 
Como Congress, his interpretation became the target of severe criticisms 
at the fifth Physical Conference of the Solvay Institute, which convened in 
Brussels, under the chairmanship of Lorentz, from October 24 to 29, 
1927. Its theme was “Electrons and Photons.” 115 But, de facto , it provided 
an excellent opportunity for a fruitful exchange of views on the founda¬ 
tions of quantum mechanics and on its interpretation among the main 
originators of the theory: Bohr, Bom, L. Brillouin, L. de Broglie, A. H. 
Compton, Debye, Dirac, Ehrenfest, Einstein, R. H. Fowler, Heisenberg, 
Kramers, Pauli, Planck, and others. 

After W. L. Bragg’s lecture on the reflection of x-rays 116 and A. H. 
Compton’s talk on the discrepancies between experiment and the theory 
of electromagnetic radiation, 117 Louis de Broglie delivered a lecture 118 
entitled “The new dynamics of quanta,” in which he outlined his own 
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early contributions and their elaborations by Schrodinger and Bom. Dis¬ 
satisfied, however, with the latter’s purely probabilistic interpretation, 
which he regarded as contradicting the explanatory aim of theoretical 
physics, he included in his lecture an exposition of his causal theory. But 
instead of discussing the theory of double solution as proposed only a few 
months earlier in his Journal de Physique paper 119 and instead of explain¬ 
ing how he conceived the motion of a particle determined by the gradient 
of the phase common to both solutions of the wave equation, he presented 
in “an incomplete and diluted” 120 form a simplified version of his original 
ideas. This “watered-down version” or “pilot-wave theory,” as he called it 
later, postulated an independent existence of the particle but did not re¬ 
gard it as a singularity of a wave field. It was immediately clear that 
nobody accepted his ideas and that the majority of the participants pre¬ 
ferred the purely probabilistic interpretation of Born, Heisenberg, and 
Bohr. In fact, with the exception of some remarks by Pauli, who justified 
his objection on the ground of Fermi’s treatment of the collision between a 
particle and a rotator, 121 de Broglie’s causal interpretation was not even 
further discussed at the meeting. Only Einstein once referred to it en 
passant . 122 As de Broglie later admitted, it was partly due to this unfavor¬ 
able reaction that he abandoned his own ideas and espoused the Copenha¬ 
gen interpretation from then on. Twenty-five years later, however, 
Bohm’s paper 123 and certain developments in the general theory of rela¬ 
tivity to which his assistant, J. -P. Vigier, drew his attention revived his 
interest in his original conceptions and reconverted him to his causal 
approach. 

De Broglie’s lecture was followed by a detailed exposition by Born and 
Heisenberg on matrix mechanics, the transformation theory, and its pro¬ 
babilistic interpretation. “Quantum mechanics,” they declared, “leads to 
accurate results concerning average values but gives no information as to 
the details of an individual event. The determinism which so far has been 
regarded as the basis of the exact sciences has to be given up. Every addi¬ 
tional advance in our understanding of the formulas has shown that a 
consistent interpretation of the quantum-mechanical formalism is possi¬ 
ble only on the assumption of a fundamental indeterminism ....” 124 Refer¬ 
ring to Heisenberg’s uncertainty principle, they stated that “the real 
meaning of Planck’s constant h is this: it constitutes a universal gauge of 
the indeterminism inherent in the laws of nature owing to the wave-parti¬ 
cle duality.” 125 And they concluded their lecture with the provocative 
statement: “We maintain that quantum mechanics is a complete theory; 
its basic physical and mathematical hypotheses are not further susceptible 
of modifications.” 126 

The last speaker was Schrodinger. He gave a detailed report on the 
development of wave mechanics with special consideration of problems 
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concerning polyelectronic systems. The climax of the Congress was the 
general discussion at the end of the meeting. After some brief introductory 
remarks Lorentz requested Bohr to speak on the epistemological ques¬ 
tions which quantum mechanics has to face. Bohr accepted the invitation 
and presented an expose which contained essentially the substance of his 
Como lecture. Einstein, it will have been noted, 127 had not attended this 
Volta celebration, and it was therefore here in Brussels that he heard for 
the first time a coherent and comprehensive exposition of the complemen¬ 
tarity interpretation. As we know, however, for example, from his corre¬ 
spondence with his intimate friend Paul Ehrenfest—Einstein disliked the 
Gottingen-Copenhagen approach to atomic physics with its emphasis on 
discontinuities and acausality ever since it had been formulated. His state¬ 
ment “I look upon quantum mechanics with admiration and suspi¬ 
cion,” 128 made in the summer of 1926, is perhaps the best characterization 
of his attitude on this issue. It was against his scientific instinct to accept 
statistical quantum mechanics as a complete description of physical rea¬ 
lity. In fact, as we know from his correspondence with Bohr and Heisen¬ 
berg, 129 he attempted, having obtained from Bohr a preprint of Heisen¬ 
berg’s paper on the uncertainty principle, to disprove it by showing that it 
is possible, for instance, to determine the path of a microphysical object 
with a precision greater than the principle allows. The present Solvay 
Congress, Einstein thought, was therefore an appropriate opportunity to 
exchange views on this question and perhaps to reach a final clarification. 
Thus, both at the sessions and at informal meetings during the evenings 
Einstein expressed his opposition to the rejection of the principle of deter¬ 
ministic description and challenged Bohr by proposing cleverly devised 
thought experiments 130 to outwit Heisenberg’s principle. Most of these 
imaginary experiments were designed to show that the interaction be¬ 
tween the microphysical object and the measuring instrument is not so 
inscrutable as Heisenberg and Bohr maintained. Einstein considered the 
interaction between a photon, for example, and a movable part of a grating 
and assumed that the momentum change of the latter, due to the interac¬ 
tion, could be accurately measured. He then claimed that on the basis of 
the conservation principle of momentum the change of momentum of the 
microphysical object (photon) could be calculated so that both, momen¬ 
tum and position, could be known with an accuracy which the principle 
prohibits. Bohr’s refutation of Einstein’s arguments usually was to dis¬ 
close a fallacy in Einstein’s reasoning due to the fact that somewhere in the 
chain of deductions the uncertainty principle had not been duly taken into 
account, and that had this been done, no increase in information would 
have resulted. How ardent these discussions must have been can perhaps 
be seen from the fact that three years later, at the sixth Solvay Congress 131 
in 1930, Einstein and Bohr resumed their debate. In his attempt to dis- 
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prove Heisenberg’s uncertainty relation A E /4ir, Einstein consid¬ 

ered a box filled with radiation and possessing a shutter in one of its walls. 
The shutter was assumed to be operated by a clockwork, enclosed in the 
box, so that a photon could be released during an accurately determinable 
time interval A t. Einstein now contended that by weighing the box before 
and after the emission of the timed radiative pulse the energy of the latter 
could be computed from the mass-energy relation E = me 2 with an arbi¬ 
trarily small error A E. After a sleepless night over the argument Bohr 
showed the next morning that, in accordance with Einstein’s own general 
theory of relativity, the effect of the gravitational field on the rate of the 
clock—which, since rigidly attached to the box, had to be assumed to be 
moving during the measurement—introduces precisely an uncertainty as 
required by the Heisenberg relation. 132 Bohr’s recourse to Einstein’s grav¬ 
itational theory in order to save the consistency of the complementarity 
interpretation which was challenged by an argument independent of a 
theory of non-Newtonian gravitation has later been erroneously criticized 
by Popper as amounting “to the strange assertion that quantum theory 
contradicts Newton’s gravitational theory, and further to the still stranger 
assertion that the validity of Einstein’s gravitational theory (or at least the 
characteristic formulas used, which are part of the theory of the gravita¬ 
tional field) can be derived from quantum theory.” 133 

Of great importance for the clarification of the meaning of the wave 
function was Einstein’s objection which he had raised at the beginning of 
the general discussion at the Solvay Congress in 1927. He considered a 
particle or photon that passes through a slit in a screen covered by a 
hemispherical photographic film. 134 The wave function associated with 
the particle or photon will be diffracted and will spread out over the film 
so that, in accordance with Born’s interpretation, practically every point 
on the film has a nonzero probability of being struck by the microphysical 
object. As soon, however, as the point of impact is recorded, the probabili¬ 
ty of finding the object at any other point on the film becomes zero. Ein¬ 
stein now pointed out that to assume with the proponents of the comple¬ 
mentarity interpretation that, as long as the object has not been observed, 
it is virtually present all over the region covered by the wave function, 
would lead to a contradiction with the theory of relativity or to conclu¬ 
sions which are strange even in the framework of classical conceptions. 
For the contraction of the wave function to a single point at the moment of 
the recording could be accounted for only by some kind of action at a 
distance. The alternative interpretation which ascribes to the microphysi¬ 
cal object a distinct trajectory would lead, as Einstein argued, to the con¬ 
clusion that the description in terms of a wave function is not complete. 

As a result of Einstein’s objection and similar considerations—as, for 
example, those which referred to the inevitable use of multidimensional 
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spaces for the representation of the wave function—it became clear that 
the wave function, as Heisenberg, Pauli, and Dirac had already empha¬ 
sized during the general discussion, does not in itself represent a course of 
events in space and time. It rather expresses our knowledge of events. It 
became clear that what the physicist does is essentially this: from an obser¬ 
vation he constructs a wave function which obeys in its progress the laws 
of quantum mechanics but constitutes at any time merely a catalog of 
probabilities for the results of subsequent measurements or observations. 
The state of affairs between one observation and the next cannot be de¬ 
scribed; but how the observation is described depends upon the experi¬ 
mental setup chosen by the experimenter. The results of observations thus 
can never completely be objectified. 

That a consistent interpretation of the accepted formalism of quantum 
mechanics makes these conclusions inevitable was the result of the general 
discussion at the Congress. Even Einstein, defeated but not convinced, 
had to admit that from the logical point of view the theory and its comple¬ 
mentarity interpretation form a consistent system of thought. 

For the next two and a half decades, the Copenhagen interpretation 
was the only accepted interpretation of quantum mechanics—and for the 
majority of physicists it is so even today. It may therefore be said that the 
search for a general consistent theory of the mechanics of atoms, a search 
which, as we have seen, was ushered in at the first Solvay Congress of 
1911, found its successful completion and finale in the fifth Solvay Con¬ 
gress of 1927. 
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CHAPTER 8 

Validation of the Theory 


8.1 Some Applications of the Theory 

For the time being the epistemological problems of quantum mechanics 
were generally regarded as of purely academic interest and received rela¬ 
tively little attention. Satisfied that the theory “works,” since it provided 
unambiguous answers whenever invoked, physicists engaged themselves 
rather in solving problems which so far had defied all previous attempts or 
which promised to open up new avenues of research. The year 1927 thus 
not only became the year in which the quantum-mechanical formalism, in 
all its essential points, received a formal completion and a consistent inter¬ 
pretation; 1927 also witnessed a veritable avalanche of elaborations and 
applications of the new conceptions and led to new insights in atomic 
physics to an unprecedented extent. The obvious agreement with previous 
results or improvement upon them, the mutual consistency of the results 
obtained, and the variety of phenomena satisfactorily dealt with dispelled 
all doubts about possible inconsistencies in the new formalism irrespective 
of its epistemological interpretation. 

It would lead us too far to give an exhaustive account of all these 
achievements. Suffice it to mention only a few typical ones—and primar¬ 
ily those which had a bearing on problems and effects that contributed to 
the conceptual development of the theory. 

How quantum mechanics accounted successfully for the Stark effect 
has already been recounted. 1 With respect to the Zeeman effect it should 
be noted that by employing the formal method of matrix mechanics, Hei¬ 
senberg and Jordan 2 succeeded in deriving Lande’sg formula and explain¬ 
ing the anomalous effect for atoms with one valence electron. 

A mathematical scheme to include the Uhlenbeck-Goudsmit spin hy¬ 
pothesis 3 in the wave-mechanical description of a system was proposed by 
Darwin 4 and by Pauli. 5 Pauli replaced Schrodinger’s wave function by a 
pair of wave functions whose squared absolute values determine the prob¬ 
ability density of finding the electron with its spin oriented parallel or 
antiparallel to an arbitrarily chosen axis of quantization. In analogy to the 
orbital angular-momentum operators he introduced spin operators, rep¬ 
resented by what were subsequently called the “Pauli matrices” which 
operate on the two-component wave function, a linear superposition of 
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the two wave functions. By studying the transformation of its components 
under a rotation of the coordinate system and extending the results to 
systems of more than one electron, Pauli developed a consistent quantum- 
mechanical spin theory which explained a large number of phenomena 
but not the origin of the electron’s spin. Pauli’s theory, like the spinless 
quantum mechanics on which it was based, failed to satisfy the require¬ 
ment of relativistic invariance. Dirac’s generalization 6 of Schrodinger’s 
equation to a set of four simultaneous partial differential equations of the 
first order, the so-called “linear Dirac equation,” met the requirement of 
relativity and accounted automatically for the spin properties without 
abandoning the general physical aspects of the wave-mechanical descrip¬ 
tion. Pauli’s formalism turned out to be a low-speed limiting case of Dir¬ 
ac’s relativistic theory of the electron. 

That for the harmonic oscillator wave mechanics agrees with ordinary 
mechanics had already been shown by Schrodinger, as we have seen. 7 A 
more general and direct line of connection between quantum mechanics 
and Newtonian mechanics was established in 1927 by Ehrenfest, 8 who 
showed “by a short elementary calculation without approximations” that 
the expectation value of the time derivative of the momentum is equal to 
the expectation value of the negative gradient of the potential-energy 
function. 9 Ehrenfest’s affirmation of Newton’s second law in the sense of 
averages taken over the wave packet had a great appeal to many physicists 
and did much to further the acceptance of the theory. For it made it 
possible to describe the particle by a localized wave packet which, though 
eventually spreading out in space, follows the trajectory of the classical 
motion. As emphasized in a different context elsewhere, 10 Ehrenfest’s 
theorem and its generalizations by Ruark 11 for conservative systems with 
an arbitrary number of particles do not conceptually reduce the quantum 
dynamics to Newtonian physics. They merely establish an analogy— 
though a remarkable one in view of the fact that, owing to the absence of a 
superposition principle in classical mechanics, quantum mechanics and 
classical dynamics are built on fundamentally different foundations. 

That Schrodinger’s theory could account also for the photoelectric 
effect, the main stronghold of the particle view of radiation, was shown in 
1926 by Wentzel 12 and by Beck, 13 who gave a wave-mechanical derivation 
of Einstein’s photoelectric equation. 14 Wentzel and Fues 15 explained the 
Auger effect, the nonradiative rearrangement of atomic electrons in the x- 
ray region of energy levels, and Gordon 16 and Wentzel 17 provided a wave- 
mechanical description of the Compton effect. How the problem of like 
particles and, in particular, that of the helium atom, the great puzzle of the 
older quantum theory, were successfully solved has already been men¬ 
tioned at the end of Sec. 7.1. Now it was understood why the older quan¬ 
tum theory with all its perturbation methods could not cope with the 
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problem. Kellner, 18 using Heisenberg’s approach, obtained the correct 
ionization potential (24.47 eV) for helium. By showing that the molecular 
field, postulated by Weiss 19 twenty years earlier, can be fully accounted 
for in terms of the quantum-mechanical exchange forces Heisenberg 20 
laid the foundations of a quantum theory of ferromagnetism. Further¬ 
more, the adiabatic theorem which through Ehrenfest’s work and 
Burgers’ proof concerning the adiabatic invariance of the periodicity 
moduli of the action integrals played such an important role in the concep¬ 
tual structure of the older quantum theory, as we saw in Sec. 3.1, could 
now be reformulated in the language of quantum mechanics: a system 
which is in a stationary state will stay in this state during an adiabatic 
process. An elegant proof of this adiabatic theorem in quantum mechanics 
was given in 1928 by Bom and Fock. 21 Similarly, by a wave-mechanical 
reformulation of the Bom-Heisenberg-Jordan matrix formalism of radi¬ 
ation processes, based on a semiclassical treatment of the electromagnetic 
field, Klein 22 was able to derive the results obtained by Kramers and by 
Kramers and Heisenberg in their pre-quantum-mechanical theory of dis¬ 
persion as described in Sec. 4.3 The insufficiencies of Klein’s semiclassical 
radiation theory and, in particular, the fact that the relation between the 
emitted radiation and the atomic moment could not rigorously be derived 
from the general principles of quantum mechanics were soon remedied by 
Dirac’s procedure 23 of quantizing the radiation field, a procedure which 
marked the beginning of quantum electrodynamics. Finally, it was shown 
in the fall of 1927 for systems obeying the Bose-Einstein statistics by Jor¬ 
dan and Klein and early in 1928 for systems obeying the Fermi-Dirac 
statistics by Jordan and Wigner that if the wave function is regarded as a 
field in ordinary space and time, but treated as a quantum-mechanical 
operator subject to certain quantum conditions, the usual quantum for¬ 
malism for particles and the formalism for waves are mathematically 
equivalent. 24 The Klein-Jordan-Wigner second quantization method was 
generally considered as a mathematical expression of the wave-particle 
duality, of the fact that “the particle picture and the wave picture are 
merely two different aspects of one and the same physical reality.” A 
better agreement between formalism and interpretation was hard to ima¬ 
gine. 

Since 1927, the development of quantum mechanics and its applica¬ 
tions to molecular physics, to the solid state of matter, to liquids and gases, 
to statistical mechanics, as well as to nuclear physics, demonstrated the 
overwhelming generality of its methods and results. In fact, never has a 
physical theory given a key to the explanation and calculation of such a 
heterogeneous group of phenomena and reached such a perfect agreement 
with experience as has quantum mechanics. 
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CHAPTER 9 

Two Fundamental Problems 


9.1 Completeness 

With the completion of its mathematical formalism, the establishment of 
its epistemological interpretation, and the accomplishment of its physical 
applications, quantum mechanics gained wide recognition as a full- 
fledged theory and gradually became a major subject in the program of the 
study of physics. The need for a coherent presentation of the theory was 
widely felt. The formulation most amenable to this end—and therefore 
also the first to be presented in the form of a textbook—was of course wave 
mechanics. In fact, the edition 1 of Schrodinger papers in book form, pub¬ 
lished at the end of 1926, may be regarded as the earliest text on this 
subject. It was soon followed by a number of treatises 2 which either con¬ 
fined themselves exclusively to wave mechanics, as, for example, Biggs’s 
work, the earliest textbook on this subject written in English, 3 or present¬ 
ed the topic in a more comprehensive context. 4 

The first complete exposition of the general formalism of quantum 
mechanics, presented in a logically consistent and axiomatic fashion, 
based on the notions of “observables” and “states” as primitives, was 
Dirac’s Principles of Quantum Mechanics . 5 The impact of the book on the 
generation of modem physicists is perhaps best characterized by a remark 
made by Lennard-Jones when the book appeared: “An eminent European 
physicist, who is fortunate enough to possess a bound set of reprints of Dr. 
Dirac’s original papers, has been heard to refer to them affectionately as 
his ‘bible.’ Those not so fortunate have now at any rate an opportunity of 
acquiring a copy of the authorized version.” 6 The key to Dirac’s success in 
unifying the presentation of the statistical transformation theory was his 
consistent use of a vector space of states that is, a space with as many 
dimensions as there are linearly independent states of the system, each 
axis corresponding to an independent state and each other state being 
completely determined by the direction of a vector relative to these axes. 
In the first half of the book Dirac expounded the general theory and in the 
second part applied it to a number of important problems such as the 
electronic structure of atoms, collision problems, and the radiation theo¬ 
ry. The second edition (1935), for which he rewrote almost the whole text 
and which had to undergo only minor modifications in subsequent edi- 
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tions, was for many years the standard text on the subject, hardly “sur¬ 
passed in brevity and elegance” by later publications in this field. 

Regarding mathematics as a handmaid of physics, as we have seen in 
connection with the introduction of the 5-function, Dirac “tried to keep 
the physics to the forefront.” 7 Not particularly interested in questions 
concerning mathematical rigor, he advanced his conception of the vector 
space without realizing at first that it was but a modification of the Hilbert 
space. Von Neumann’s Mathematische Grundlagen der Quantenme - 
chanik 8 with its emphasis on mathematical rigor was therefore a timely 
contribution, supplementary to Dirac’s work. “While Dirac presents his 
reasoning with admirable simplicity and allows himself to be guided at 
every step by physical intuition—refusing at several places to be burdened 
by the impediment of mathematical rigor—von Neumann goes at his 
problem equipped with the nicest of modem mathematical tools and ana¬ 
lyzes it to the satisfaction of those whose demands for logical complete¬ 
ness are most exacting.” 9 

The first half ot von Neumann’s monumental book is essentially an 
elaboration of his papers published in 1927, whereas the second half, in¬ 
spired by the ideas of Bohr and Heisenberg, concerns itself with the prob¬ 
lems of causal description and measurement. Dealing with ensembles of 
systems, von Neumann distinguished between “homogeneous ensembles” 
(“einheitliche Gesamtheiten”) or, as Weyl used to call them, “pure 
cases” (“reine Falle”) 10 and “mixtures” (“Gemische”). The former were 
defined as represented by a single normalized state function expressible 
in terms of a complete orthonormal system t/t = 2 a n if / n ; in it every phys¬ 
ical quantity has the same expectation value, whether measured in the 
whole ensemble or in any of its subensembles; it thus represents maximal 
knowledge. Mixtures were defined, as corresponding to physical condi¬ 
tions for whose specification a set of ^’s was required: ^ (1) ,^ (2) ,...,^ (/ °,..., 
each ^ (/c) being present with a probability p ik) ; a mixture 2 p ik) tfj ik) thus 
describes a system in which knowledge is not maximal. 11 Following a 
suggestion by Landau, 12 von Neumann 13 introduced, for the unique char¬ 
acterization of the statistics of an ensemble, the “statistical operator” U 
which eventually, under the name of “/? matrix,” became a major tool in 
quantum statistics. Von Neumann showed that quite generally 
(A) = Tr( UA ) and that U 2 = U is a criterion for the existence of a pure 
case. 

In the fourth chapter of the book von Neumann applied these notions 
to his famous analysis of the problem whether the statistical theory of 
quantum mechanics as established constitutes a logically closed theory or 
whether it could be reformulated as an entirely deterministic theory by the 
introduction of hidden parameters, that is, additional variables which, 
unlike ordinary observables, are inaccessible to measurements and hence 
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not subject to the restrictions of uncertainty relations. He came to the 
conclusion that “the present system of quantum mechanics would have to 
be objectively false, in order that another description of the elementary 
processes than the statistical one may be possible.” 14 To prove this 
theorem he first showed that dispersion-free ensembles do not exist. For in 
such ensembles, he argued, (A 2 ) = (A ) 2 for all operators A or equivalent¬ 
ly Tr (UA 2 ) = (Tr (UA)) 2 \ substituting for A the operator he ob¬ 
tained 15 U=0otU= 1; but U= 0, leading to (A ) = 0 and furnishing no 
information at all, had to be ruled out and U — 1, implying in a space of 
infinitely many dimensions (l ) = oo , is not normalizable and could cor¬ 
respond only to a state which is not dispersion-free. Having thus demon¬ 
strated that ensembles are never dispersion-free, von Neumann showed 
that homogeneous ensembles (pure cases) do exist by proving that any 
ensemble whose statistical operator is a projection operator is homoge¬ 
neous. Finally, he pointed out, the statistical nature of homogeneous 
states cannot be removed by conceiving them as mixtures of substrates, 
each of them associated with a definite set of values of hidden parameters, 
and by assuming that the dispersion of the homogeneous state results from 
an averaging over such “actual” states; because then, he declared, a homo¬ 
geneous ensemble could be represented as a mixture of two different en¬ 
sembles, contrary to its definition, and because dispersion-free ensembles 
“which would have to correspond to the ‘actual’ states do not exist,” as 
previously demonstrated. 

Von Neumann thus succeeded in bringing the foremost methodolog¬ 
ical problem of quantum mechanics down from the realm of speculation 
into the reach of empirical decision. For the essence of his demonstration 
was this: as long as observation and experiment enforce upon us the pres¬ 
ent formalism of quantum mechanics, it is logically impossible to com¬ 
plete this formalism to a deterministic description of physical processes. 16 

It is interesting to note that at the same time and independently of von 
Neumann’s work, Solomon, 17 utilizing the technique of Stieltjes’s treat¬ 
ment of the “problem of moments,” proved that the introduction of hid¬ 
den parameters leads to inconsistencies with the accepted formalism. In 
contrast to von Neumann’s demonstration, which, as may have been no¬ 
ticed, was independent of the existence of noncommuting operators, Solo¬ 
mon’s proof presumed the uncertainty principle. 

In view of the decisive importance of von Neumann’s hidden-param¬ 
eter theorem for the foundations of quantum mechanics various attempts 
were subsequently made to prove the theorem by different methods. We 
mention only Destouches-Fevrier’s proof, 18 which was based on the exis¬ 
tence of noncommuting operators, and a recent demonstration by Jauch 
and Piron 19 which is independent of the assumption, tacitly made by von 
Neumann but untenable in systems with superselection rules, 20 namely, 
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that every projection operator is an observable, and also independent of 
the assumption that the expectation values of noncompatible observables 
are additive, a property of states which was rigorously established 21 only 
in 1957. In addition, these authors showed that “the hidden variable inter¬ 
pretation is only possible if the theory is observably wrong”; that is, they 
showed that any introduction of hidden variables to “complete” the theo¬ 
ry modifies it to such an extent that it leads to empirically refutable re¬ 
sults, a point not raised in von Neumann’s analysis. 


9.2 Observation and Measurement 

Another aspect of quantum mechanics which von Neumann regarded as 
so far insufficiently dealt with was the problem of observation and mea¬ 
surement, that is, the relation between elements of the physical theory and 
human experience, a subject to which he devoted the last two chapters of 
his book. 22 For he realized that the accessibility of quantum-mechanical 
results to the observer raises a serious problem for the following reason. In 
classical physics the interaction between object and observer, if not re¬ 
garded as wholly negligible, was in principle eliminable since the measure¬ 
ment process was fully analyzable in terms of the equations of motion 
alone. In fact, measurement, in classical physics, constituted a chapter of 
applied physics: physical theory and the analysis of measurement em¬ 
ployed the same category of concepts. In quantum mechanics, however, a 
fundamental disparity exists between the language of the theory and the 
language of ordinary experience, which latter, as Bohr repeatedly 
stressed, is the indispendable medium for the ultimate formulation of 
results of measurement. 

Inspired by Bohr’s paper 23 of 1929 on the quantum of action and the 
description of nature, von Neumann espoused the idea that every quan¬ 
tum-mechanical measuring process involves an unanalyzable element. He 
postulated that, in addition to the continuous casual propagation of the 
wave function governed by the Schrodinger equation, the function under¬ 
goes, in the case of a measurement, due to an intervention of the observer 
on the object, a discontinuous, noncausal, and instantaneous change. 
Thus, in a mixture, characterized by the statistical operator U , a measure¬ 
ment of the observable R, supposed to have a complete orthonormal set of 
eigenfunctions <p u <p 2 ,— corresponding to the (discrete, nondegenerate) 
eigenvalues A l9 /l 2 ,..., changes U to 


u' = Z(<p n ,UyjP M 
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an irreversible process, whereas the causal, continuous, and reversible 
change is given by the unitary transformation 

( liritH \ TT (liritH \ 

U. - exp [ -—] ] 

where H is the energy operator (Hamiltonian) of the system. In the par¬ 
ticular case of a pure state expanded in the eigenfunctions of 

the observable R to be measured, the measurement of R “reduces” t/t to 
\fj k , the eigenstate whose eigenvalue A k is the result of the measurement. 
According to von Neumann this “reduction of state” or, more generally, 
“reduction of wave packet” is an unavoidable element in every measure¬ 
ment process and assures that the same result will be obtained for an 
immediately following measurement of the same observable (“projection 
postulate”). 

Von Neumann explained the insufficiency of continuous transforma¬ 
tions U^> U t by pointing out that in a measurement not the system S itself 
but rather S combined with a measuring apparatus M (and ultimately 
with the observer) is the object of observation. “The theory of the mea¬ 
surement is a statement concerning S + M, and should describe how the 
state of S is related to certain properties of the state of M (namely, the 
positions of a certain pointer, since the observer reads these). Moreover, it 
is rather arbitrary whether or not one includes the observer in M, and 
replaces the relation between the state S and the pointer positions in M by 
the relations of this state and the chemical changes in the observer’s eye or 
even in his brain.” 24 More specifically, the situation as conceived by von 
Neumann may be described as follows. A physical system S is supposed to 
be in the state ^ = 2 a n ip n (x), where (x) is the eigenfunction belong¬ 
ing to the eigenvalue A n of the observable R to be measured. To measure 
R , S has to be coupled with a measuring apparatus M whose pointer 
readings g 0 , (supposed to form a discrete set) correspond to the 

eigenfunctions <p 0 ( y), q> x (y),... of an observable G, <p 0 ( y) representing 
the zero (initial) state of M with g 0 . Before the coupling the state function 
of the system S + M is 

=q>o(y) X ) 

and after the coupling 


^ 2 (x,y) = £ a n if> n ( x)<p n ( y) 

that is, a superposition of different states tf> n q> n with probabilities \a„ | 2 . 
The indefiniteness of G (and, correspondingly, of the pointer readings g n ) 
is resolved by the reduction 'V 2 ^ x l>k ( x)<p k ( y) which occurs at the mo- 
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ment when the observer becomes aware tnat the pointer reading is g k . If, 
instead, a second apparatus M ' had been applied to measure G , then in 
exact analogy the final state of S + M + M ' before the reading would have 
been = 2 a n xf> n q) n q )' and so on. Von Neumann showed that the inser¬ 
tion of such additional instrumental (or “objective”) stages does not af¬ 
fect the outcome of the measurement, namely, the value A k of R, a conclu¬ 
sion by which he justified the adherence to the principle of the 
psychophysical parallelism according to which, as Bohr contended, it 
“must be possible so to describe the extraphysical process of the subjective 
perception as if it were in reality in the physical world.” 25 

Von Neumann’s truncation of an otherwise infinite regress 
S + M + M ' + • • • by resorting to the subjective perception of the observ¬ 
er or, as London and Bauer 26 formulated it, to the observer’s “faculty of 
introspection,” a process which finds its physical expression in the “re¬ 
duction of the wave function,” introduced the factor of human conscious¬ 
ness as an integral part into the formulation of quantum mechanics. As 
von Neumann admitted himself, he was greatly influenced in this respect 
by Leo Szilard, who, at that time Privatdocent in Berlin, studied the prob¬ 
lem to what extent the intervention of an intelligent being on a thermody¬ 
namic system may produce a perpetuum mobile of the second kind. Szi¬ 
lard showed that owing to the very measurement performed by such a 
being—in fact, it suffices to ascribe to the measuring agency merely a 
memory—the system’s behavior changes distinctly and may lead to a 
decrease of the entropy, thus violating the second law of thermodynamics, 
unless the measuring process itself implies an entropy increase. 27 Szilard, 
in turn, was led to these thought-provoking ideas—somewhat reminiscent 
of Maxwell’s “demon”—through his study of Smoluchowski’s paper 28 on 
the limitations of the second law which Smoluchowski had read in Gottin¬ 
gen, on the invitation of the Wolfskehl committee, 29 shortly before his 
appointment to the directorship of the Institute of Physics at the Universi¬ 
ty of Cracow in 1913. Smoluchowski’s conception of an intellect that is 
constantly cognizant of the instantaneous state of a dynamical system and 
thus able to invalidate the second law of thermodynamics without per¬ 
forming work was probably the earliest logically unassailable speculation 
about a physical intervention of mind on matter. It paved the way, as we 
have seen, toward von Neumann’s far-reaching conclusion that it is not 
possible to formulate the laws of quantum mechanics in a complete and 
consistent way without a reference to human consciousness. Von Neu¬ 
mann fully realized that this conceptual procedure led to the inexorable 
result that “experience only makes statements of this type: an observer has 
made a certain (subjective) observation; and never any like this: a phys¬ 
ical quantity has a certain value,” 30 or, as Heisenberg 31 once said, “the 
laws of nature which we formulate mathematically in quantum theory 
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deal no longer with the particles themselves but with our knowledge of the 
elementary particles.” 

Although unobjectionable from the purely logical point of view, von 
Neumann’s theory of measurement soon became the target of severe criti¬ 
cisms. Thus, to mention a recent example, his theory was censured as 
being “founded on a radically subjectivistic (solipsistic) philosopy.” 32 It 
was also pointed out 33 that the transformation of a pure state into a mix¬ 
ture, as envisaged by von Neumann, could not easily be reconciled with 
certain conservation principles. Margenau and his school, 34 insisting on 
the importance of a sharp distinction between “state-preparation” and 
“measurement,” rejected the projection postulate on the ground that a 
measurement often destroys the state corresponding to the eigenvalue 
obtained as, for instance, in the measurement of the energy of an atom by 
observing its emission spectrum where it is precisely the decay of the state 
which renders possible the measurement. Viewing the measuring process 
as an interaction between a micro object and a macro system, in the course 
of which the latter evolves from a thermodynamically metastable state to 
a stable state, depending on the state of the micro object, Ludwig and his 
school 35 tried to find the answer to the problem in the ergodic behavior of 
macro observables. Numerous other attempts have been made to modify 
von Neumann’s theory of measurement or to replace it by what seemed to 
these authors 36 a more acceptable solution of the problem. 

While most of these attempts suggested modifications of only certain 
parts of von Neumann’s formulation of quantum mechanics or even of 
only the interpretation as given by him, the basic algebraic structure of the 
formalism as a whole also soon became the object of important generaliza¬ 
tions. In the search for a completely rigorous mathematical foundation of 
the theory and a solution of the difficulties mentioned at the end of Chap. 
6, two main lines of attack have been adopted. The first of these, aiming at 
an explicit algebraic generalization of the operator calculus and the struc¬ 
ture of the Hilbert space, began when Jordan 37 introduced into quantum 
mechanics the so-called quasi-multiplicative 38 commutative algebra, that 
is, an algebra which differs from the noncommutative, but associative, 
matrix algebra in so far as the associative product AB (of two matrices) is 
replaced by the commutative expression \{AB + BA). Jordan justified 
this proposal by pointing out that “the whole statistics of measurements 
on quantum-mechanical systems involves relations of only quasi multipli¬ 
cation and not of complete multiplication between measurable quanti¬ 
ties.” 39 Using the abbreviation 

{A,B,C} = (AB)C-A(BC) 

Jordan showed 40 that any two quantum-mechanical observables A , B sat¬ 
isfy {A,B, A 2 } = 0 and that the last equation holds whenever 
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[A,B, A 2 ] = 0, where the square brackets are defined, in terms of ordinary 
multiplication, by [A,B,C] = (AB)C — A(BC). Jordan therefore sug¬ 
gested that hypercomplex algebras for which [A,B, A 2 ] = 0, which he 
called “r-number algebras” (“r-Zahl-Algebren”), are the appropriate 
generalizations of the conventional algebra as used in von Neumann’s 
formalism. A few months later, however, Jordan, von Neumann, and 
Wigner, 41 having made a penetrating analysis of these algebras, estab¬ 
lished, with the help of a theorem of Albert, 42 the equivalence between 
almost all (real 43 ) r-number algebras and algebras whose elements are 
ordinary (real) matrices, with products defined by quasi multiplication, 
the only exception being the algebra of all three-rowed Hermitian matri¬ 
ces with elements in the real nonassociative algebra of Cayley numbers. 44 

In 1959 Finkelstein, Jauch, and Speiser 45 showed that only three types 
of Hilbert spaces are possible in quantum mechanics: the real Hilbert 
space, the complex Hilbert space, and the quaternion Hilbert space. The 
first type, that is, a Hilbert space whose scalar multipliers are real 
numbers, became the subject of extensive investigations by Stueckelberg 
and his collaborators, 46 while Finkelstein, Jauch, Schiminovitch, and 
Speiser, 47 as well as Dyson 48 and Emch, 49 studied the structure of a qua¬ 
ternion quantum mechanics, 50 that is, an operator algebra in a Hilbert 
space whose scalar multipliers are quaternions. Reference to this possibil¬ 
ity has been made, in a different context, at the beginning of Chap. 5. 51 
Recently, Goldstine and Horwitz 52 investigated the properties of a Hil¬ 
bert space with Cayley numbers as scalar multipliers. 

The second line of development began with the by now classical work 
of Birkhoff and von Neumann, 53 who showed that the calculus of quan¬ 
tum-mechanical propositions, concerning results of measurements, is for¬ 
mally indistinguishable from the calculus of linear subspaces of the infi¬ 
nite-dimensional Hilbert space—pure states corresponding to 
one-dimensional subspaces (rays)—with respect to set products, linear 
sums, and orthogonal complements, and that the relations between mea¬ 
surements of different observables are reflected by the (ortho-comple¬ 
mented) lattice structure of such subspaces, while the distributive law 
afl (6Uc) = (adb) U (aflc), characteristic for the propositional calcu¬ 
lus of classical systems, no longer holds. Thus, the lattice structure of 
subspaces—reflecting the fact that measurements of different observables 
are apt to interfere with each other—was found to provide a rigorous 
logicomathematical formulation of the uncertainty relations and comple¬ 
mentarity properties. In 1952 Wick, Wightman, and Wigner 54 showed in 
their important work on superselection rules that there are physical sys¬ 
tems for which the sets of operators representing observables do not form 
an irreducible system in the space of the state vectors, as previously as¬ 
sumed, that not every self-adjoint operator is an observable, and that 



Two Fundamental Problems 


397 


certain subspaces in a Hilbert space of state vectors cannot be connected 
by observables, the state being subject to selection rules. In 1963 Jauch 
and Piron 55 made it clear that this situation can be incorporated without 
major difficulties into the lattice structure of the quantum-mechanical 
propositional calculus. A recent study by Horwitz and Biedenham 56 on 
the structure of a quantum theory which works with a Hilbert space over 
an arbitrary finite algebra with unity quantity promises to establish im¬ 
portant connections between these two main lines of development. 

The preceding brief description of recent elaborations of von Neu¬ 
mann’s formalism is intended to show that today von Neumann’s work 
still forms the main foundation for theoretical research in quest of a math¬ 
ematically rigorous formulation of quantum mechanics. 
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Concluding Remarks 

Any further discussion of the problems of completeness (hidden vari¬ 
ables) and measurement (observation), which, as the preceding refer¬ 
ences have shown, lead us to current research on the foundations of quan¬ 
tum mechanics, would be beyond the scope of the present study. 

There is one question, however, which deserves to be discussed in con¬ 
clusion of our exposition, namely, whether the conceptual scheme which 
quantum mechanics has established and which forms the foundation of 
current research in atomic physics constitutes, if viewed from a broad 
historical vantage, a complete break with the past in the sense of being an 
isolated chapter in the history of human thought or whether it is merely 
another stage in an uninterrupted course of scientific thought and another 
link in a chain of intellectual tendencies. 

It was one of the major objectives of our study to investigate to what 
extent precisely the conceptual foundations of quantum mechanics 
evolved from earlier conceptions and what factors played a decisive role in 
this process of formation. As we have seen, science never worked in a 
vacuum and every innovation, however radical it was, had its roots in the 
past. Hence, from the psychological, heuristic, and methodological point 
of view the answer is clear. 

But our present question goes deeper. It asks whether the resulting 
conceptual scheme, apart from its formative aspects in the mind of its 
originators, falls in line with the historical development of physics 
through the ages. In one respect, at least, this seems to be the case. 

Reviewing the historical development of physical thought, we may say 
that Aristotelian physics—and most of its medieval modifications—was a 
physics of an essentially unlimited number of qualities or properties which 
were assumed to inhere in the object under investigation. Atomism, as 
already conceived by the Greeks, attempted to reduce the immense diver¬ 
sity of physical qualities to a limited number of fundamental properties 
possessed by the atom. Guided by the logical principle that an observed 
property of a physical object cannot be explained by merely ascribing the 
property to the constituent parts of the object, atomists were soon led to 
distinguish two categories of observable properties: those which were con¬ 
ceived as belonging to the substances, independent of the observer—and 
these were the geometrical and dynamical properties—and those which, 
like color, heat, sound, or taste, were conceived as not being “in the ob¬ 
ject,” but merely “phantasms” in the “sentients.” 1 

Classical physics, with mechanics as its foundation, was but a quantita¬ 
tive attempt to implement this program of “reduction.” When Galileo 
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differentiated between “primary” and “secondary” qualities and de¬ 
scribed physical objects “as bounded and as possessing this or that 
shape,... as in this or that place during this or that time, as in motion or 
at rest, as in contact or not with some other body, as being one, many, or 
few...” 2 and regarded these properties alone as real, he codified the 
program of classical physics. 

It is commonly claimed that classical physics, in contrast to modem 
physics, provided a visualized model of physical reality. Strictly speaking, 
this is not true. For as soon as the atom was regarded as endowed with 
only shape, position, and motion and divested, for example, of color, the 
methodological gain in intellectual unification was paid for by the loss of 
visualizability or picturability. In fact, how could a colorless object be 
“pictured”? 3 If we speak of the “picturability” of Newtonian physics, we 
use this term in a more “pallid” and abstract sense: we mean that those 
properties which presumably “do really exist in the bodies themselves” 4 
can be presented by geometrical-kinematical models whose colors (and 
other secondary qualities) are completely irrevelant to the purpose for 
which they were designed, as, for example, to serve as didactic aids in the 
instruction of mathematics. 

In classical physics the primary qualities of shape, position, and motion 
(later also mass and charge) were regarded as objective features of phys¬ 
ical reality or as inseparable attributes of matter, independent of, and 
irreducible to, observation. Apart from Berkeley’s well-known philosoph¬ 
ical rejection of any distinction between primary and secondary proper¬ 
ties, 5 the first breach in this classical conception, from a logical point of 
view, was probably Herbart’s analysis in which he attempted to show that 
the notion of one thing with many properties is self-contradictory. Her- 
bart 6 resolved this paradox by reducing properties , including primary 
qualities, to relations of one thing to others. In a similar vein, though more 
from the standpoint of the epistemology of physics, Stallo pointed out that 
objects are known only through their relations to other objects. He de¬ 
clared: “They have, and can have, no properties, and their concepts can 
include no attributes, save these relations, or rather, our mental represen¬ 
tation of them. Indeed, an object can not be known or conceived otherwise 
than as a complex of such relations. In mathematical phrase: things and 
their properties are known only as functions of other things....” 7 The 
idea that properties of objects are effects (“Wirkungen”) exerted by them 
. either on our senses or on other natural objects was stressed also by Helm¬ 
holtz when he said that “... each quality or property of a thing is, in 
reality, nothing else but its capability of exercising certain effects upon 
other things ... it can never depend upon the nature of one agent alone, 
but exists only in relation to, and dependent on, the nature of some second 
object, which is acted upon.” 8 Also Hdffding, whose philosophy, as we 
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have seen, influenced Bohr so mucii, declared: “The ‘qualities* of a thing 
are indeed nothing more than the different forms and ways in which this 
thing influences that thing or is influenced by it. They are a thing’s capa¬ 
bilities of doing and suffering.” 9 

These statements, if interpreted as including also what was regarded as 
primary qualities, found their full corroboration in modem physics. At 
first, the theory of relativity revealed that the geometric-kinematic prop¬ 
erties of position, time, and velocity (as well as length, size, duration, and 
mass), previously regarded as objective features, depend upon the frame 
of reference. Subsequently, quantum mechanics showed in addition that 
these properties are relative also to the means of observation. 10 Even the 
notion of “individuality” or “identity” of elementary particles which, be¬ 
cause of its ontological implications, enjoyed a status above and beyond 
that of “properties” or “qualities,” loses in quantum mechanics, and par¬ 
ticularly in the context of exchange phenomena, as we have seen, 11 its 
universal applicability which it possessed in classical physics, where par¬ 
ticles could unambiguously be associated with sharply defined trajectories 
continuous in space and time. 

Contrary to the Aristotelian physics of qualities and in contrast to the 
Newtonian physics of primary properties, the language of quantum me¬ 
chanics is a language of interactions and not of attributes: processes , and 
not properties , are the elements of its syntax. Every attempt, for example, 
to describe in terms of attributes (or their mixtures) the spins in an unpo¬ 
larized beam of electrons is doomed to failure. 

But the human mind does not readily resign to such a renunciation, to 
such a replacement of properties by interactions (or “particles” by “occu¬ 
pation numbers”). Even Pauli, when defining the spin as “a peculiar, 
classically not describable two-valuedness,” 12 added “in the quantum- 
theoretic properties of the optical electron.” And although the revocation 
of primary qualities invalidates the classical distinction between kinema¬ 
tics and dynamics, with statics as a special case of the latter, 13 and thus 
certainly denies any logical priority of the former, even Heisenberg re¬ 
garded the uncertainty relations as a means to reinstate that distinction 
and “to obtain thereby an intuitive understanding of the quantum-me¬ 
chanical relations.” 14 Much of the present dispute on the foundations of 
quantum mechanics seems to derive from the reluctance to renounce the 
attribution of primary qualities to elementary particles. 

That the classical belief in the reality of physical properties, even if only 
in the reality of primary qualities such as position or momentum, is incon¬ 
sistent with the conceptual scheme of quantum mechanics was strikingly 
illustrated by the widely discussed Einstein-Podolsky-Rosen paradox. 15 
Defining a physical theory as complete only if “every element of the phys¬ 
ical reality” has “a counterpart in the physical theory,” Einstein, Po- 
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dolsky, and Rosen based their argument on the following criterion of 
physical reality: “If, without in any way disturbing a system, we can pre¬ 
dict with certainty the value of a physical quantity, then there exists an 
element of physical reality corresponding to this physical quantity.” 16 
Thus, in their view, the possibility of an exact evaluation of a dynamical 
quantity establishes this quantity as an element of reality, entitled to have 
a numerical counterpart in the theory, if the theory is complete. 

With these epistemological assumptions in mind, the authors consid¬ 
ered a combination of two partial systems I and II, which had interacted 
with each other only during a certain interval of time 0</< T 7 , and showed 
that the formalism of quantum mechanics makes it possible, without in 
any way disturbing II, to deduce the precise value of a variable a n of II 
from the measurement, after T, of a variable a Y in I. From their epistomo- 
logical criterion of reality they thus concluded that there exists an element 
of physical reality—let us call it e n —which corresponds to the exactly 
calculated value of a n . In fact e u must have existed prior to the measure¬ 
ment of a l because, in view of the absence of any interaction after T, the 
measurement itself could not have affected II in any way. 

If instead of a Y another variable a{ had been measured in I, the exact 
value of another variable a[ Y in II could have been predicted, again with¬ 
out in any way disturbing II, and another element of reality e' u would have 
been established (which likewise must have existed prior to the measure¬ 
ment of a [). 

The authors then showed that a Y and a{ can be chosen in such a way 
that a u and a' u belong to noncommuting operators A n and A h, respec¬ 
tively. 17 Since a wave function can precisely specify at most only one 
eigenvalue of two noncommuting operators at a time, either e n or e ' u — 
but never both of these two elements of reality—can have a counterpart in 
the theory. Quantum mechanics, they therefore concluded, does not pro¬ 
vide a complete description of physical reality. 

The challenge was soon answered, at least from the viewpoint of the 
complementarity interpretion of the theory, by Bohr’s insistence 18 on the 
essential influence of the procedure of measurement on the conditions 
underlying the very definition of physical quantities. Considering these 
conditions as an inherent element of any phenomenon to which physical 
reality can be attributed, Bohn pointed out that a mechanical system, even 
though having ceased to interact dynamically with other systems, does 
not constitute an independent seat of “real” attributes. Bohr’s rejection of 
the possibility of associating quantities with physical systems in a posses¬ 
sive manner, which rejection invalidated the epistemological premise of 
the paradox, was clearly but an expression of the fact that, within the 
conceptual framework of the Bom-Bohr-Heisenberg interpretation. 
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quantum mechanics is ultimately a physics of processes and not of proper¬ 
ties, a physics of interactions and not of attributes, even not of primary 
qualities of matter. 

From this point of view quantum mechanics may rightfully be regard¬ 
ed as falling in line with the general development of theoretical physics. 
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Derivation of the Equation (1.6)* 

Planck considered a linear oscillator whose dipole moment f(t) vibrates 
in the z direction and whose total energy U 0 = \Kf 2 + \Lf 2 . For 
U 0 = const, 


Kf+ Lf= 0 


(A.l) 


its proper frequency is 


v = 



277 


(A.2) 


and 


(f y = 

V J max f 


2 2C/ 0 


K 


(A.3) 


According to Hertz, 1 such an oscillator emits per period of vibration the 
energy 

3277Vt/o 


3c 3 K 

For small damping, (A. 1) has to be replaced by 


(A.4) 


Kf+ Lf+ — jKLf= 0 


(A.5) 


77 


where a is the logarithmic decrement (for amplitudes). Hence, by (A.4), 
the logarithmic decrement for the energy is 2a = 3277 4 v 2 /3c 3 fiT and 


a = 


1677 ^ 

3 c 3 K 


(A.6) 


In an external field of intensity Z, (A.5) becomes 


Kf+ Lf+ — jKLf= Z 

77 


or, by (A.2) and (A.6), 


3 c 3 a 


/+ layf + 47 7 2 v 2 / = —— Z. 

477 k 


(A.7) 


•See p. 9. 
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If Z is represented by a Fourier series, 




then for large T 


ft*\ 3c 3 v 1 • (2irnt 

AO =ttttX -s»nr« cos'- 


167T 3 V 2 " « 


where 
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Since cr is small, sin y H ~0 unless n~vT. Hence 
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(A.9) 
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where 
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27t(v — w/T) 
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The average energy U= U v of the oscillator, given by U = Kf 2 
= (lbTrWScV) is therefore 


U = 


3 c 2 


X ^ » sin 2 n • 


3277 2 <tv 3 


(A.10) 


On the other hand, the intensity J of the exciting radiation, obtained by 
averaging Z 2 over the time interval from t = 0 to t = T, is 


and can be written as 



(A. 11) 


(A.12) 


where J^ is the energy absorbed by a linear oscillator of proper frequency 
// and negligible damping. Hence 

J^=k^U^ (A.13) 

where k^ is a function of //. Comparison of (A.ll) with (A.12) shows 
that 
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or after interchange of integration and summation, 

3c 3 
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Since only those // need to be considered for which // 
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Since the last integral is av/ 2, /r v = 327t 2 v 2 /3c 3 and, by (A. 13), 

327t 2 v 2 

J v —izEzL u 

3 c 3 

Finally, as the density of the field energy is given by 


(A.15) 


u = — (E 2 + H 2 ) = —6Z t = —/ 

87T Sir 4ir 

where E and H are the electrostatic and magnetic-field vectors, it follows 
that 


u 


V 



or 


u v 



which is Eq. (1.6) on page 9. For details of this derivation the reader is 
referred to Planck’s writings. 2 
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The Einstein-Podolsky-Rosen Paradox 

Let x x denote the variables used to describe system I and x 2 those for II. 
The wave function ^ of the combined system I + II can then be expressed, 
for t > T y by the biorthogonal expansion 


= X *M*2)«»(*l) (B.l) 

n = 1 

where u x (x x ) are the eigenfunctions of A x (the operator representing the 
observable a x in I) and tf> k ( x 2 ) describes the state of II if a measurement of 
a x yielded a k , the eigenvalue of A lf belonging to u k (x x ). In the contin¬ 
uous case (B.l) has to be replaced by 

J oo 

^a(*2)«a(*l)<fa (B.2) 

— 00 

where a now denotes the continuous eigenvalues of A x . If the combined 
system I + II is supposed to be composed of two particles whose position 
coordinates are x x and x 2 , respectively, and if 


V(x l9 x 2 ) 



(x x - x 2 + x 0 )p 



(B.3) 


where x 0 is an arbitrary constant, for A x the momentum of particle I may 
be chosen. Since 


( 2m 

u p (x x ) = expi¬ 
re then an eigenfunction of A x corresponding to the eigenvalue p, (B.2) 
shows that 
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4> p (x 2 ) = exp 
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(x 2 - x 0 )p 


(B.5) 


describes the state of II, if a measurement of A x in I yielded the value p. But 
(B. 5) is an eigenfunction of the operator A n = P = (h /2m) d /dx 2 , corre¬ 
sponding to the eigenvalue —poi the momentum of II. 

If, however, the position coordinate of I is chosen as A [, its eigenfunc¬ 
tion is^*! — x), corresponding to the eigenvalue*. Since (B.2) can now 
be written 
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¥Ui,x 2 ) 



(* - x 2 + x 0 )p 


5(Xj —x)dx 


dp (B.6) 


it is clear that 



(x - x 2 + x 0 )p 


dp = hS(x — x 2 + x 0 ) 


(B.7) 


describes the state of II, if a measurement of a[ in I yielded the value x . But 
(B.7) is an eigenfunction of the operator^ ' u =Q = x 2 , corresponding to 
the eigenvalue x + x 0 of the position coordinate of II. The remark that A n 
and A h are noncommutative completes the formal presentation of the 
paradox. For further details and attempts at resolution the reader is re¬ 
ferred to the literature on this subject. 1 
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Appendix C 

When early in 1966 the first edition of The Conceptual Development of 
Quantum Mechanics with its final section on the Einstein-Podolsky-Ro- 
sen paradox was published one could hardly anticipate the decisive impor¬ 
tance of this paradox for the subsequent development of quantum me¬ 
chanics. For it was precisely in 1966 that John Stewart Bell’s celebrated 
paper “On the problem of hidden variables in quantum mechanics” ap¬ 
peared in the July issue of Reviews of Modern Physics. 1 Together with 
Bell’s later written, but earlier published, essay “On the Einstein Podolsky 
Rosen paradox” 2 it marks the beginning of the still presently ongoing 
debate on fundamental issues such as the locality or separability and even 
reality of quantum-mechanical processes and entities. 3 

In view of the by now clearly established historical importance of the 
Einstein-Podolsky-Rosen paper (EPR), we attach to Appendix B this 
postscript, designed to throw some light on the pre-history of the EPR 
paradox. 4 

Although the concept of the completeness (“Vollstandigkeit”) of a 
formalized theory had been made use of in the axiomatization of geometry 
(Moritz Pasch, David Hilbert) and of logic (Emil Leon Post) already in 
the early years of this century and had been casually mentioned also by 
Bohr and Heisenberg in the twenties, the EPR claim of the incompleteness 
of quantum mechanics was the first paper which explicitly defined this 
notion in physics and applied it in a significant way. The very title of the 
EPR paper, “Can quantum-mechanical description of physical reality be 
considered complete?” and its conclusion “that the wave function does 
not provide a complete description of physical reality” seem clearly to 
indicate that this notion was meant to play the dominant role in this paper. 
A closer examination of the text of the paper, of Einstein’s correspondence 
on this matter, and of additional circumstantial facts shows, however, that 
this has not been Einstein’s intention. Thus in a letter to Schrodinger 
Einstein complained that the paper “has not really brought out what I 
actually had in mind, since the main issue is, so to say, buried under 
leamedness.” 5 The letter also implies that Einstein had never seen the 
final draft of the paper before its publication. 

To fully understand the situation it is useful to distinguish between the 
following three components of the paper: (1) the general conceptual 
background and the physical scenario, (2) the thought-experiment in¬ 
volving the two partial systems and its mathematical elaboration, and (3) 
the printed version of the paper as published in The Physical Review. 
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The first component must be attributed to Einstein, the second to Ro¬ 
sen, and the third to Podolsky—or as Professor Rosen expressed it, 
“roughly speaking, one can say that Einstein contributed the general 
point of view and its implications, I found the x/j function, and Podolsky 
composed the paper.” 6 

That, indeed, Einstein contributed the general point of view is also 
confirmed by one of his letters to Paul S. Epstein in which Einstein wrote: 
“I myself came across these ideas, starting from a simple thought-experi¬ 
ment.” 7 Einstein’s approach to the EPR problem had, in fact, its point of 
departure in Bohr’s rebuttal of the well-known photon-box thought-ex¬ 
periment by means of which Einstein had tried to refute the Heisenberg 
time-energy uncertainty relation. 8 Einstein realized that in order to ren¬ 
der his argument immune against Bohr’s recourse to general relativity he 
had to eliminate gravitational effects altogether. Einstein modified there¬ 
fore the experiment and imagined the box with the clock and the shutter, 
containing a quantum of radiation of unknown frequency, to be moveable 
in a horizontal direction along a frictionless rail. This rail is assumed to 
serve as a reference frame with which the box could be rigidly connected 
and at the end of which either an absorbing screen or a reflecting mirror 
could be mounted. Sitting on top of the box, an observer is assumed to 
release the shutter at a precisely determined moment to emit a photon. 

Thereupon the observer can either immediately connect the box with 
the rail, read its position and predict the time of arrival of the photon at the 
screen or he can measure the momentum of the box relative to the refer¬ 
ence frame by means of a Doppler effect with arbitrarily low frequency 
and determine, by means of the well-known recoil formula, the momen¬ 
tum of the photon (or its energy). Einstein pointed out that the light 
quantum, after leaving the box, represents a “real state of affairs” (“einen 
realen Sachverhalt”) which can hardly be thought to depend on what 
kind of measurement is being performed with the box. It follows, so ar¬ 
gued Einstein, that any property of the photon, found by a measurement 
performed with the box, must also exist if the measurement would not 
have been performed at all. In particular, the photon must at every mo¬ 
ment of time possess a definite position as well as a definite momentum, a 
situation not describable in terms of a wave function. 9 

Following a suggestion made by Ehrenfest, 10 Einstein modified the 
box-particle thought-experiment into a two-particle thought-experiment, 
involving a particle of nonzero mass and a photon; assuming an interac¬ 
tion between these two in the past, Einstein showed how, without disturb¬ 
ing the heavy particle, its position or its momentum could be determined 
by measuring the corresponding quantity of the photon. And he conclud¬ 
ed his account by saying: “For this reason I feel myself inclined to ascribe 
objective reality to both these observables.” 



414 


Conceptual Development of Quantum Mechanics 


Since this statement of the assertability of an “objective reality” of a 
particle’s property on grounds of its determinability without disturbing 
the particle obviously expresses the gist of the EPR reality criterion, we 
are on safe ground to assume that the reality criterion as stated in the EPR 
paper has been contributed by Einstein. But such an assertability clearly 
presupposes what has been called “the principle of separability” (“Tren- 
nungsprinzip”), that is the claim that the state of a system is independent 
upon what happens to another system, sufficiently separated in space 
from the former, even if the two systems have been interacting with each 
other in the past. It was the status of this separability principle in quantum 
mechanics which has been occupying Einstein’s mind for a long time. 
Already in 1933, when visiting Brussels, Einstein addressed Leon Rosen- 
feld with the following question: “What would you say of the following 
situation? Suppose two particles are set in motion towards each other with 
the same, very large, momentum, and they interact with each other for a 
very short time when they pass at known positions. Consider now an 
observer who gets hold of one of the particles, far away from the region of 
interaction, and measures its momentum; then, from the conditions of the 
experiment, he will obviously be able to deduce the momentum of the 
other particle. If, however, he chooses to measure the position of the first 
particle, he will be able to tell where the other particle is. This is a perfectly 
correct and straightforward deduction from the principles of quantum 
mechanics; but is it not very paradoxical? How can the final state of the 
second particle be influenced by a measurement performed on the first, 
after all interaction has ceased between them?” 11 

When Einstein in his above-mentioned letter to Schrodinger com¬ 
plained that the EPR paper did not really express “what he actually had in 
mind”, he simply deplored that the published version of the paper almost 
completely ignores the role of the separability principle in quantum me¬ 
chanics. In addition, the letter also reveals that Einstein’s conception of 
the completeness or incompleteness of the quantum-mechanical descrip¬ 
tion did not fully agree with the completeness criterion of the published 
paper. 12 It is therefore reasonable to conclude that the published version 
of the completeness criterion was not proposed by Einstein. 

Concerning Rosen’s contribution to the paper, Rosen himself may be 
quoted: “During a period when discussions of these questions were taking 
place rather actively [during the spring of 1935 the three authors met 
regularly in the morning between 10 and 12 o’clock in Einstein’s office at 
the Institute—M.J. ], it occurred to me one evening that there was this 
function tp describing a state in which the measurement of x x would give x 2 
and the measurement of p x would give p 2i so that measurements on one 
particle would give information about the other. Since the second particle 
was not being disturbed, it appeared that one could assign to it precise 
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values of x and /?, even though the formalism of quantum mechanics was 
not capable of describing such a situation.” 13 Rosen, in other words, 
found precisely a wave function which exactly describes the state of the 
particles in Einstein’s two-particle thought-experiment. 

It is one among those numerous examples which show how Einstein’s 
originally nonmathematical but strongly intuitive reasoning in terms of 
thought-experiments lends itself subsequently to a rigorous mathematiza- 
tion. 

Turning now to the role of Podolsky we can again quote Rosen: “Po¬ 
dolsky did the writing because he liked to use language of logic and was 
good in it.” 14 In this context it should be recalled that in 1934 the distin¬ 
guished logician Kurt Godel visited the Princeton Institute for Advanced 
Study to give there a series of lectures on his, by now famous, incomplete¬ 
ness proofs, namely that the predicate calculus of first order is not com¬ 
plete (in the strong sense) and that sufficiently rich theories such as arith¬ 
metic are incomplete (both in the strong and in the weak sense). It was not 
much later that Einstein, Rosen, and Podolsky decided to write their 
paper on the incompleteness of quantum mechanics. 

Although no documentary evidence seems to be extant which shows 
that Podolsky attended Godel’s lectures, it is highly probable that he did 
so because he was deeply interested in mathematical logic. It is also not 
unreasonable to assume that it was under the influence of these lectures 
that Podolsky committed himself to compose the paper, to stress the in¬ 
completeness claim at the expense of the separability issue, and in particu¬ 
lar to formulate the published version of the completeness criterion. Tex¬ 
tual analysis of the paper and a certain ensuing estrangement in the 
personal relations between Einstein and Podolsky seem to confirm this 
assumption. 
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